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Acid Q-Aiaylase 
Field o f the Inventiion 

The present invention is directed to purified 
genetic and protein sequences encoding the acid a- 
amylase protein of B. acidocaldarius , recombinant 
constructs and hosts transformed therewith that are 
capable of expressing such sequences^ and the use of 
such genetic sequences for the expression of acidic 
aamylase, and the use of the recombinant ly-produced 
acid a-amylase protein for industrial and agricultural 
applications such as, for example, starch 
liquefaction, and silage preparation* 

Brief Descriptio n of the BackarQur^i;^ A-rf- 

Lactobacillus and Bacillus strains offer many 
potential advantages in the production of cloned gene 
products, as compared with Escherichia coli. First, 
they are non-pathogenic and do not synthesize 
endotoxins- Second, many of the gene products are 
secreted into the growth medium, in contarast to E. 
coli, which retains most of the proteins due to the 
presence of an outer membrane. Third, they have been 
widely used for production of enzymes in large-scale 
fermentation processes. While Lactobacillus is not an 
industrial enzyme producer, it is highly used in the 
food and animal feed processing industry. 

One important highly desirable exoenzyme secreted 
in large^ amounts by Bacilli is a-amylase. a-Amylase 
is used in a variety of industrial applications, for 
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example, starch liquefaction. However, the industrial 

use of this enzyme is limited in many cases by its ^ 

sensitivity to acidic -conditions. That is, most forms 

of the enzyme are not enzymatically stable at acidic ^ 

pH's. 

At least four acidophilic and acid-stable a- 
amylase isozymes have been reported in different 
strains of B. acidocaldarius (Kanno, M. , Agric. Biol* 
Cfaera. 50:23-31 (1986); Buonocore, V. et al., tJ. 
Bacteriol. 228:515 (1976); Boyer, E.W.. et al.. 
Starch/ StarJet 31:166 (1979); and Uchino, F. , Agric. 
Biol. Cbem. 46:7 (1982). An "acidic" a-amylase is 
chciracterized as being relatively stable to heat under 
acidic conditions. For example, the acid a-cimylase's 
produced by Bacillus acidocaldarius strain A2 retains 
more than 79-90% of its activity after 30 min of 
incubation at pH 2-0 (70**C) and at pH 4.5 (90^*0) in 
the absence of substrate (Kanno, M. , Agrxc* Biol. 
Cbem. 50:23-31 (1986) « 

a-Amylase has also been isolated from B. 
amyloliqaBfaciens (Ingle Bt al.^ Adv. Appl. Microbiol • 
24:257-278 (1987)). This enzyme has an M^-value of 
about 50,000 daltons suid has been secpienced (Takkinen 
et almr Biol. Cbem 258: 1007-1013 (1983); Chung et 
al., Biocdiem. jr. 285:387-395 (1980)). The expression 
of this en2:yme in B. subtilis has been reported 
("Expression and Regulation of the Bacillus 
amyloliquBfaciens a-amylase gene in B. suJb tills," 
P.Kallio, Ph.D. dissertation. University of Helsinki, 
1987). 

Acidic amylase sequences are especially desirable 
for industrial purposes, \diere pH values may fall 
(meaning- the medium becomes more acidic) during a • 
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desired reaction due to the natural catalytic action 
of the enzymes therein. For example, a process strain 
that possesses an acid a-amylase activity would be 
valuable for the Lactobacillus strains as lactic acid 
bacteria are widely used for preservation of many 
starch containing raw materials (e.g., cereal grains, 
edible roots and crop residues) . 

Although such approaches and applications have 
been highly desired in the art (Applied Environment. 
Microbiol. 55:2130-2137 (1989)), they have not been 
possible to obtain. The main obstacle was the lack of 
an a-amylase enzyme that is not inactivated by the low 
growth pH of Lactobacillus. So far, no genes of 
bacteirial origin, coding for an acid a-amylase enzyme, 
have been characterized and the isolation of large 
quantities of such acid a-amylases from natural 
sources is relatively expensive.- This has precluded 
the wide-spread adoption of such enzymes in industry. 
Accordingly, a need exists for a cost-efficient 
recombinant source of a highly stable acid a-amylase. 

SUMMARY OP THF! TNVENTION 

Recognizing the importance to the feed industry 
of a host that would be capable of providing an acid- 
stable c-amylase activity during the food preservation 
period, and cognizant of the lack of hosts capable of 
providing such enzymatic activity in the currently 
utilized processes, the inventors have investigated 
the properties of the acid a-amylase. from B. 
acidocaldarius . The inventors desired to generate a 
new IracrtoJbaclIIus host, such host having a novel 
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property that -would be advantageous in the food and 
feed industry- 

These efforts have culminated in the development 
of novel recombinant forms of acid a-amylase, hosts 
expressing such acid a-amylase^ and highly improved 
processes for the preservation of food utilizing such 
hosts. 

According to the invention, there are first 
provided a polynucleotide, such polynucleotide 
providing the coding sequence of an acid a-amylase. 

According to the invention, there are also 
provided recombineuit vectors, such vectors providing 
a host escpressible form of acid a-amylase. 

According to the invention, there eire further 
provided host cells transformed with polynucleotides 
and/or vectors that eire capable of expressing 
recombinant forms of acid a-*£aaylase. 

According to the invention, there are further 
provided methods for producing the genetically 
engineered or recombinant protein acid a-amylase of 
the invention using such hosts. 

According to the invention, there are further 
provided methods for the utilization of the acid a- 
amylase enzyme of the invention « 

PESCRIPTION OF THE FIGURES 



Figure 1. SDS-PAGE of the (NH^)2S0^ precipitated 
proteins. L2uies 1 and 2 represent Coomassie Brilliant 
Blue R stained gel and zymogram, respectively. The 
arrows show the positions of the bands corresponding * 
amylase *^ activity. Molecular weight markers 

(Phsanaacia) are indicated on the left. • . 



wo 92/19744 



PCr/FI92/00138 



-5- 

Figure 2. Effect of pH on the activity of B. 
acidocaldarius amylase (s ) . Enzyme activity in 100 jtil 
of the 0.5 M sodium acetate, pH 5.0 eluted proteins of 
B. acidocaldarius cultures was measured using Phadebas* 
amylase test at 60 "C in o.i h citric acid, 0.2 M 
NajHPO^. 

Figure 3. The nucleotide sequence of the B- 
acidocaldarius acid a-amylase gene. The putative 
-35 and -10 regions (bases 126 and 148, respectively) 
of the amylase promoter are underlined. The potential 
signal sequence cleavage site (base 273) is indicated 
by an arrow. The N-terminal sequence (base 1594) of 
the 90 kd protein is underlined with a dotted line. 
The two alternative N-termini of the signal peptide 
are indicated by an asterisk. 

PBSCRIPTION OP THE PRRPR RRED RMROnTMRWS 
I. Definitions 

In the description that follows, a number of 
terms used in recombinant DNA (rDNA) technology are 
extensively utilized. In order to provide a clear and 
consistent understanding of the specification and 
claims, including the scope to be given such terms, 
the following definitions are provided. 

gyocess ptrajp. a "process strain" is a 
bacterial strain that causes or contributes to a 
desired cdiange in a biotechnical process. - 

Bioteehnicsal mrnnt^ntt. a biotechnical process 
is a process that depends upon the production of 
bacteria' or b^omass for the achievement of a desired 
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chemical 1a:ansformat:ion of the process medium, such 

bacteria or biomass providing a component or ^ 

environment necessary for such conversion. 

Fodder • "Fodder" is anything fed to 
domesticated emimals, and especially, coarse food for 
cattle, horses or sheep. 

. Forage. "Forage" is food for domesticated 
animals that is taken by browsing or grazing. 

Silaae . "Silage" is- fodder converted into 
succulent feed for livestock through processes of 
anaerobic acid fermentation (as occurs in a silo) • 

Gene . A DNA secpience containing a template for 
a BNA polymerase. The ENA tremscribed from a gene may 
or may not code for a protein. BNA that codes for a 
protein is termed messenger RNA (mRNA) . 

A "complementcOY DNA" or "cDNA" gene includes 
recombinemt genes synthesized by reverse transcription 
of mHNA and from whicdx intervening sequences 
(introns) have been removed. 

Gene-tic sequence. As used herein, 'the term 
"gene1:ic sequences" is intended to refer to a nucleic 
acid molecule (prefered^ly DNA) • Accordingly, "acid 
amylase genetic sequences" are DNA or lOIA sequences 
that possess a nucleotide sequence that encodes the 
acid a-amylase protein. 

Promoter. The term "promoter" as used herein 
refers to a module or group of modules which, at a 
minimum, provides a binding site or initiation site 
for RNA polymerase action that is sufficient to 
initiate tremscription of the opered^ly linked in a 
desired host. A promoter Is generally composed of ' 
multiple operably linked genetilc elements termed 
herein "modules." » 
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Promoter Modu 3 e . The term "module" as in 
"promoter module" refers to a genetic transcriptional 
regulatory element which provides some measure of 
control oyer the transcription of operably linked 
coding sequences or other operably linked modules. 

Each module in a promoter can convey a specific 
piece of regulatory information to the host cell's 
transcriptional machinery. At least one module in a 
promoter functions to position the start site for RNA 
synthesis. Other promoter modules regulate the 
frequency of transcriptional initiation. Typically, 
modules which regulate the frequency of 
transcriptional initiation are located upstream of 
(i.e., 5« to) the transcriptional start site, although 
such modules may also be found downstream of (i.e., 3\ 
to) the start site. 

The term "target module," as iised herein, refers 
to a transcriptional regulatory element which confers 
the ability to respond to enhancer gene activity 
(i.e.^ such as the protein or peptide encoded by an 
enhancer gene) on a promoter which otherwise would not 
respond, or would respond less efficiently, to such 
enhancer gene activity. 

The term "initiation module" refers to a promoter 
module which is required to initiate transcription of 
operably linked genes with RNA polymerase. In 
prokaryotic promoters, initiation modules are usually 
located at about -lo and -35 nucleotides from the 
start site of transcription. 

By "hybrid promoter" is meant a promoter in which 
an initiation module is operably linked to a 
heterologous target module. A target module which is 
heterologous to an initiation module is a target 
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module which is not found naturally operably linked to 

this initiation module in the host cell. ^ 

Operable linkage. An "operable linkage" is a 
linkage in which a sequence is connected to another ^ 
sequence (or sequences) in such a way as to be capable 
of altering the functioning of the sequence (or 
sequences) • For example, a protein encoding sequence 
which is operably linked to the hybrid promoter of thie 
invention places expression of the protein encoding 
sequence under the influence or control of the regula- 
tory sequence* Two DNA sequences (such as a protein 
encoding sequence and a promoter region sequence 
linked to the 5* end of the encoding sequence) are 
saiid to be operably linked if induction of promoter 
function results in the trcoiscription of the protein 
encoding sequence mRNA and if the nature of the 
linkage between the two DNA sequences does not 
(1) result in the introduction of a frame-shift 
mutation, (2) interfere with the ability of the 
expression regyulatory sequences to direct the express 
sion of the bdRNA or protein. Thus, a promoter region 
would be operably linked to a DNA sequence if the 
promoter were capaJ^le of effecting transcription of 
that DNA sequence* 

Cloning vector , A "cloning vector" is a plasmid 
or phage DNA or other DNA sequence which is able to 
replicate autonomously in a host cell, and which is 
chsoracterized by one or a small number of endonuclease 
recognition sites at which such DNA sequences may be 
cut in a determinsible fashion without loss of an 
essential biological function of the vector, and into 
which DNA loay be spliced in order to bring abotit its 
replication and cloning. The cloning vector may « 
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further contain a marker suitable for use in the 
identification of cells transformed with the cloning 
vector. Markers, for example, are erythromycin and 
kanamycin resistance. The term "vehicle"' is sometimes 
used for "vector." 

Expression vector. An "expression vector" is a 
vector similar to a cloning vector but is capable of 
expressing a structural gene which has been cloned 
into the expression vector; after transformation of 
the expression vector into a host. In an expression 
vector, the cloned structural gene (any coding 
sequence of interest) is placed under the control of 
(i.e., operably linked to) certain control sequences 
which allow such gene to be expressed in a specific 
host. In the expression vector of the invention, a 
desired structural gene is operably linked to the 
hybrid promoter of the invention. Expression control 
sequences will vary, and may additioxially contain 
transcriptional elements such as termination sequences 
and/or translational elements such as initiation and 
termination sites. 

The expression vectors of the invention may 
fiirther provide, in an expression cassette other than 
the one. providing the hybrid promoters of the 
invention, secpiences encoding a desired enhancer gene. 
In a preferred embodiment, such enhancer gene would be 
the enhancer gene which encodes the protein which 
regulates the target module of the hybrid promoter. 

Eunctional Derivative^ A "functional derivative" 
of a molecule, such as a nucleic acid or protein, is 
a molecule which has been derived from a native 
molecule, and which possesses a biological activity 
(either 'functional or structural) that is substan-. 
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tially similar to a biological activity of the native 

molecule, but not identical to the native molecule. By ^ 

a functional derivative of a nucleic acid sequence 

(herein the "second sequence") that encodes a protein ^ 

(herein a "first" sequence) is meant (1) a nucleic 

acid sequence that does not possess the exact same 

nucleo1:ide sequence as the first sequence but which 

encodes the same amino acid sequence; and, a nucleic 

acid sequence that does not encode the ssune €uaino acid 

secpience but sfhich encodes a protein with biochemical 

properties (such as, for example, enzymatic stability 

in acidic conditions) equivalent to (i.e., the same as 

or slighly different from) the protein encoded by the 

first nucleic acid sequence. 

A functional derivative of a protein is a protein 
that retains a desired function of the "parent" 
protein from which it is derived. A functional 
derivative of a protein may or may not contain post-* 
tremsla-tioneJ. modifications, such as covalently linked 
carbohydra'ke, depending on the necessity of such 
modifica'kions for the performance of a specific 
function. The term "functional derivative" is 
intended to include the "fragments," "variants," or 
"chemical derivatives" of a molecule. 

As used herein, a molecule is said to be a 
"chemical derivative" of another molecule when it 
contains additional chemical moieties not normally a 
part of the molecule. Such moieties may improve the 
molecule's solubility, absorption, biological half 
life, etc. The moieties may alternatively decrease 
the toxicity of the molecule, eliminate or attenuate 
any undesirable side effect of the molecule, etc. 
Hoieties capable of mediating such effects are • 
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disclosed in Remington's Pharmaceutical Sciences 
(1980) . Procedures for coupling such moieties to a 
molecule are well known in the art. 

fragment > A "fragment" of a molecule such as a 
nucleic acid or protein is meant to refer to a mole- 
cule which contains a portion of the complete sequence 
of the native molecule. 

Variant, A "variant" of a molecule such as a 
nucleic acid or protein is meant to refer to a mole- 
cule substantially similar in structure and biological 
activity to either the entire molecule, or to a 
fragment thereof, but not identical to such molecule 
or fragment thereof. A variant is not necessarily 
derived from the native molecule itself. A gene allele 
is an example of what is meant by a variant • nucleic 
acid sequence euid an enzymatic isozyme is an example 
of what is meant by a variant enzyme sequence. Thus, 
provided that two molecules possess a similar 
biological activity or function that characterizes 
molecules as a •family', they are considered variants 
as that term is used herein even if the composition or 
secondary, tertiary, or quaternary structure of one of 
the molecules is not identical to that found in the 
other, or if the sequence of nucleic acid (or amino 
acid residues) is not identical, or if the synthesis 
of one of the variants did not derive from the other. 
Thus the invention intends to include all variants 
(genomic alleles and enzymatic isozymes) of the acid 
a-amylase of the invention. 
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II- Gene-tic Engineering of Acid tt-amvlase 

Prior to the invention, the cloning of the acid 
amylase gene was not thought to be possible due to the 
lack of cross-reacting antibodies, the lack of 
sufficient protein to obtain such antibodies, the lack 
of a relieible probe and the lack of protein sequence 
data from which to derive such probe. 

The available antisera or gene probes of other 
known bacterial amylases (for example, those of 
amylollqaefaciens strain A2 as described by Kstnno, H. 
Agric. Biol. Chem. 50:23-31 (1986), B- licheniformis 
as described by siato, N., Arch. Blochem. Bipbys, 
155:290 " (1973) and B. subtxlxs) does not provide any 
crossreactivity or positive hybridization when tested 
with material derived from acidocaldarius . When the 
amylase preparation from B. amyloliquefacxens was 
analyzed by zymography, it revealed an amylase band of 
molecular weight 150,000-200,000 daltons. This is 
much higher than previously reported for other 
bacterial amylases. Due to the very low initial yield 
and large size of the acid amylase, it was not 
possible to determine the N-terminal secpience of this 
protein by standard protein chemistry methods in order 
to obtain an oligonucleotide probe. 

Thus it was necessao^f to identify clones to the 
B. acidalcarius acid amylase without having availsible 

(1) emtibodies that cross-reacted with the protein or 

(2) Protein or DNA sequence information upon which to 
design a oligonucleotide probe. 

To solve this problem, a new method for the 
identification of acid amyleise protein and clones was 
developed* A 90,000 dalton peptide was found in 
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protein extracts of B. acxdocaldarius , that had a very 
low specific amylase activity- An assumption was made 
that this peptide would be sufficiently related to the 
large acid amylase protein to allow selection of 
clones to the acid amylase protein, using probes 
designed from the sequence of the smaller peptide. 
This was not a risk-free assumption as the low level 
"amylase** activity displayed by the 90,000 dalton 
peptide could easily have been a side activity of a 
different type of enzyme, and not related to the acid 
amylase enzyme of the invention per se. Thus, there 
was no way of confirming that the assumption upon 
which the cloning of the enzyme of the invention was 
based was correct until the entire cloning and 
sequencing was complete. 

The process for genetically engineering the acid 
a-amylase sequences of the invention is facilitated 
through the cloning of genetic sequences which are 
capable of providing specific protein encoding 
sequences. Genetic sequences which are capable of 
providing protein encoding sequences may be derived 
from genomic DNA, synthetic DNA, cloned DNA and 
combinations thereof. The preferred species source of 
the acid a-amylase of the invention is B. 
acidocsLldarius , although any source of an acid a- 
amylase may be used. 

Genetic (protein encoding) genomic DNA will not 
contain introns in prokaryotes, although it may 
contain spacers between transcriptional xmits. As 
outlined below, such genomic DNA may be obtained in 
association with the 5* promoter region and/or the 3* 
transcriptional termination region if desired. 
Further/ such genomic DNA may be obtained in 
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assoc±at;ion with t±ie genetic sequences which encode a 
5' non-translated region of the desired mRNA and/or ^ 
with the genetic sequences which encode the 3' non- 
translated region. To the extent that a host cell can 
recognize the tremscriptional and/or translational 
regulatory signals associated with the expression of 
the mRNA and protein, and to the extent that such 
signals do not impede the expression of the 
recombinant acidic aamylase of the invention, then the 
5' smd/or 3« non-transcribed regions of the native 
gene, and/or, the 5" and/or 3* non-translated regions 
of the xsRNA, may be retained and employed for 
transcriptional and translational regulation* 

The coding sequences for the acid a-amylase of 
the invention is not rearramged by the native 
bacterial host prior to expression in such host. To 
obtain coding sequences for proteins whose genes axe, 
not rearranged prior to expression, genomic DNA caoi be 
extracted and purified from any cell of any host which 
carries the coding secpience, whether or not the cell 
e3q>resses the protein. Such extraction of genomic DNA 
can be performed by means well known in the art (for 
exan^le, see Guide to Molecalar Cloning Techniques^ 
S.It. Berger et al., eds.. Academic Press (1987)). 

Alternatively, nucleic acid sequences that encode 
a desired protein Ccui also be obtained by DNA 
treinscribed from mRNA specific for that protein. mRNA 
can be isolated from any cell which produces or 
expresses the protein of interest and used to produce 
cDNA by mesms well known in the art (for example, see 
Guide to Molecular Cloning Techniques, S.L. Berger et 
al., eds.. Academic Press (1987)). Preferably, the 
mRNA preparation used will be enriched in mRNA coding « 
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for the desired protein, either naturally, by 
isolation from a cells which are producing large 
amounts of the protein, or in vitro, by techniques 
commonly used to enrich mRNA preparations for specific 
sequences, such as for example, gel electrophoresis, 
sucrose gradient centrifugation- 

To prepare DNA for cloning into a cloning vector 
or an expression vector, a suitable DNA preparation 
(either genomic DNA or cDNA) is randomly sheared or 
enzymatically cleaved, respectively. Such DNA can then 
be ligated into appropriate vectors to form a 
recombinant gene (either genomic or cDNA) library. 

A DNA sequence encoding a protein of interest or 
its functional derivatives may be inserted into a 
cloning vector or an expression vector in accordance 
with conventional techniques, including blunt-ending 
or staggered-ending termini for ligation, restriction 
enzyme digestion to provide appropriate termini, 
filling in of cohesive ends as appropriate, alkaline 
phosphatase treatment to avoid undesirable joining, 
and ligation with appropriate ligases. Techniques for 
such manipulations are disclosed by Mahiatis, T. , et 
al.. Molecular Cloning (A Laboratory Manual), Cold 
Spring Harbor Laboratory, second edition, 1989) , and 
are well known in the art. 

Libraries containing clones encoding a desired 
protein or a desired transcriptional regulatory 
element may be screened and a desired clone identified 
by any means which specifically selects for the DNA of 
interest. For example, if a clone to an acid a- 
- amylase is desired, such a clone may be identified by 

any meEuis used to identify acid ce-amylase protein or 
mRNA, including, for example, a) by hybridization with 
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an appropriate nucleic acid probe (s) containing a 
sequence (s) specific for the DNA of this protein, or 
b) by hybridization-selected translational analysis in 
which native mRNA that hybridizes to the clone in 
question is translated in vitro ajad the translation 
products are further characterized, or, c) if the 
cloned genetic sequences are themselves capable of 
expressing mRNA, by inonunoprecipitation of a 
translated protein product produced by the host 
containing the clone* 

Oligonucleotide probes specific for a desired 
protein can be used to identify a desired clone. Such 
probes can be designed from knowledge of the amino 
acid sequence of the desired protein. The sequence of 
amino acid residues in a peptide is designated through 
the use of the commonly employed three-letter or 
single-letter designations. A listing of these three- 
letter and one-letter designations may be found in 
teactbooks such as Biochemistry, Lehninger, A., Worth 
Publishers, Mew York, NY (1970) . As used herein, when 
the amino acid secpzence is listed horizontally, the 
amino terminus is intended to be on the left end and 
the carboxy terminus is intended to be at the right 
end« 

Because the genetic code is degenerate, more than 
one codon may be used to encode a peurticular amino 
acid. The peptide fragments are analyzed to identify 
sec[uences of amino acids that may be encoded by 
oligonucleotides having . the lowest degree of 
degeneracy. This is preferably accomplished by 
identifying sequences that contain amino acids that 
are encoded by only a single codon. 
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Although occasionally an amino acid sequence may 
be encoded by only a single oligonucleotide sequence, 
frequently the amino acid sequence may be encoded by 
any of a set of similar oligonucleotides. Impor- 
tantly, whereas all of the members of this set contain 
oligonucleotide sequences that are capable of encoding 
the seune peptide fragment and, thus, potentially 
contain the same oligonucleotide sequence as the gene 
which encodes the peptide fragment, only one member of 
the set contains the nucleotide sequence that is 
identical to the exon coding sequence of the gene. 
Because this member is present within the set, and is 
capable of hybridizing to DNA even in the presence of 
the other members of the set, it is possible to employ 
the unf ractionated set of oligonucleotides in the same 
manner in which one would employ a single oligo- 
nucleotide to clone the gene that encodes the peptide. 

Therefore, using the genetic code, one or more 
different oligonucleotides can be identified from the 
amino acid secpience, each of which would be capable of 
encoding the desired protein. The probability that a 
particular oligonucleotide will, in fact, constitute 
the actual protein's encoding sequence can be 
estimated by considering abnormal base pairing 
relationships and the frequency with which a 
particixlaa: codon is actually used (to encode a 
particular amino acid) in the host cell. Such "codon 
usage rules" are disclosed, for example, by Lathe, R. , 
et al., J. HolBc. BioL. 1SJ:1-12 (1985). Using the 
"codon usage rules" of. Lathe, a single oligonucleotide 
sequence, or a set of oligonucleotide sequences, that 
contain^ a theoretical "most probable" nucleotide 
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sequence capable of encoding the acid a-amylase of the 
invention is identified. 

In addition to the codon usage rules, 
oligonucleotide design can may utilize the use of 
deoxyinosine at ambiguous codon positions. This 
approach is particularly useful when the required DNAs 
sequence is derived from a poorly characterized 
orgcuiism like B\ acidocaldarius (Tak2ihaski et al^r 
Proc. Natl. Acad. Sci. USA 82:1931-1935 (1985)). 

The suitable oligonucleotide, or set of 
oligonucleotides, that are capable of encoding a 
fragment of the desired gene (or which is 
complement2U7 to such an oligonucleotide, or set of 
oligonucleotides) may be synthesized by means well 
known in the art (see, for example, SynthBsis and 
Application of DNA and RNA, S.A. Narang, ed., 1987, 
Academic Press, San Diego, CA) and employed as a probe 
to identify and isolate the cloned gene by techniques 
known in the art. Techniques of nucleic acid 
hybridization and clone identification are disclosed 
by Uaniatis, T«, et al. (In: Molecular Cloning A 
Laboratory Manual, Cold Spring Heurbor Laboratories, 
Cold Spring Harbor, NY (1982)), and by Hames, B.D., et 
al., in: Nucleic Acid Hybridization^ A Practical 
Approach, TBL Press, Washington, DC (1985)), which 
references are herein incorporated by reference. 
Those members of the above-described gene library 
which are found to be capable of such hybridization 
are then analyzed to detexnoine the extent and nature 
of the sequences which they contain. 

To facilitate the detection of the desired 
encoding- sequence, the above-described DNA probe may 
be labeled with a detectable group- Sucdi detectable 
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group can be any material having a detectable physical 
or chemical property. Such materials have been well- 
developed in the field of nucleic acid hybridization 
and in general most any label useful in such methods 
can be applied to the present invention. Particularly 
useful are radioactive labels, such as ^^P, ^H, ^^C, ^S, 
'^1/ or the like. Any radioactive label may be 
employed which provides for an adequate signal and has 
a sufficient half-life. The oligonucleotide may be 
radioactively labeled by means well-known in the art, 
for example, "nick-translation" and T4 DNA polymerase 
replacement synthesis. 

Alternatively, polynucleotides are also useful as 
nucleic acid hybridization probes when labeled with a 
non-radioactive marker such as biotin, an enzyme or a 
fluorescent group. 

Thus, in summary, the actual identification of 
acid a-amylase peptide sequences permits the 
identification of a theoretical "most probable" DMA 
sequence, or a set of such sequences, capable of 
encoding such acid a-amylase. By constructing an 
oligonucleotide complementary to this theoretical 
sequence (or by constructing a set of oligonucleotides 
complementary to the set of "most probedDle" oligo- 
nucleotides) , one obtains a DNA molecule (or set of 
DNA molecules), capable of functioning as a probe (s) 
for the identification and isolation of clones 
directed to acid a-amylase. 

The above discussed methods are, therefore, 
capable of identifying genetic sequences ^ that are 
i capzible of encoding acid a-amylase, or fragments, 

variants and functional derivatives of acid a-amylase. 
In order to further characterize the cloned acid a- 
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amylase (or its fragment, variant or functional 
derivative) and especially, in order to produce 
recombincint acid a-amylase, it is desirable to express 
the proteins which the sequences encodes. 

III. Expression of Proteins Using th e Expression 

Vectors of the Invention 

Expression of acid a-amylase protein allows the 
identification of clones- that are capable of 
ea^ressing the acid a-amylase protein of the 
invention, or fragments, variants or functional 
derivatives thereof. Chsuracteristics unicpie to acid a- 
amylase that may. be used to identify the acid a- 
amylase protein, fragment, variant or functional 
derivative include the ability to specifically bind 
acid a-amylase antibodies, the ability to elicit the 
production of acid a-amylase antibodies that are 
capable of binding to the native protein, and the 
ability to provide an enzymatic function specific to 
acid a-amylase, su^ as, for example, enzymatic 
stability at an acidic pH at which other "non-acidic" 
a-amylase enzymes are relatively unstable, among 
others. 

In a preferred embodiment, amylase-negative 
bacteri2J. strains are used as hosts. Especially 
preferred are amylase negative mutemts of Bacillus or 
Xr. plantarum (whi^ does not produce amylase by 
definition) • 

To express the acid a-amylase of the invention, 
transcriptional and translational signals recognizable 
by an appropriate host are necessary. Cloned 



wo 92/19744 



PCr/FI92/00138 



-21- 

sequences encoding acid a-amylase, its fragments, 
variants or functional derivatives, obtained through 
the methods described above, and preferably in a 
double-stranded form, may be operably linked to 
sequences controlling transcriptional expression in an 
expression vector. Such sequences may be introduced 
into a host cell to produce recombinant acid a- 
amylase, its fragments, variants and functional 
derivatives. 

In addition to transcriptional signals, it may 
also be advantageous to use trans latibnal and/ or 
secretion signals derived from other bacterial sources 
provided that such translational and secretion signals 
are fxmctional in the chosen host cell. This is 
discussed further infra. 

According to the invention, any prokaryote host 
may be utilized. In a preferred embodiment, a member 
of the BsLclllus or Lactobacillus genera are used as 
the host cell for expressing the desired protein of 
the invention. Such members include £. subtilis, B. 
licbeniformis^ B. amyloliqaefaciBns^ B. polymyxa, B. 
stearotbermophilus, B. theroproteolyticus^ B. 
coagulans^ B. thxiringiensis, B. jnegaterium,. B. cereus^ 
B. natto, and, B. acidocaldarius . In an especially 
highly preferred embodiment, the host cell is 
Lactobacillus • 

Lactobacillus species that are associated with 
silage and are expecially preferred as hosts in the 
processes of the invention are L, plantarum, i. 
brevis, Ir. bucbnerie, L. coryniformis, L. cizrvatus, Ir. 
casei, Xr. £ermentum, L. acidophilus, and L. 
salivarius. 
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A nucleic acid molecule, such as DNA, is said t:o 
be "capable of expressing" a a polypeptide like the 
acid a-amylase of the invention if the nucleic acid 
Bolecule contains expression control sequences that 
provide transcriptional regulatory information and 
such sequences are operably linked to the nucleotide 
sequence which encodes the acid a-amylase. 

The precise nature of the regulatory regions 
needed for gene expression may vary between species or 
cell types, but shall in general include, as neces- 
sary, 5 » non-transcribing and 5 " non-translating (non- 
coding) sequences involved with initiation of tran- 
scription and translation respectively. Especially, 
such 5* non— trsinscribing control sequences will 
include a region which contains a promoter capable of 
functioning in the host cell- 

Expression of a recombinant protein in prokaryo- 
tic hosts requires the use of regulatory regions 
functional in such hosts, and preferably prokaryotic 
regulatory systems. A wide varielry of transcrip- 
tional and translational regulatory sequences can be 
employed, depending upon the nature of the prokaryotic 
host. Preferably, these regulatory signals are 
associated with a particular gene which is capable of 
a high level of esqiression in the host cell. 

If desired, a fusion product of the desired 
protein may be constructed. For example, if the 
genetic sec[uence encoding a desired acid a-amylase (or 
its fragment, veuricUit or functional derivative) does 
not possess a sequence encoding a signal sequence 
functional in a certain host, such signal sequence may 
be operably linked to the desired genetic sec[uence, 
thus allowing secretion of the protein from, or the 
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membrane compartmentalization of the protein in, the 
host cell. Such signal sequences may be designed with 
or without specific protease sites such that the 
signal peptide sequence is amenable to subsequent 
removal. Alternatively, the native signal sequence of 
a protein of the invention may be used. 

In addition to using either a native signal 
sequence or a signal derived from some other bacteria, 
it is often useful to construct a signal sequence 
fusion sequence, utilizing only part of the native 
signal sequence. Potential fusion sites between two 
signal sequences are the hydrophobic regions or the 
helix breaker residue between the hydrophobic region 
and the C-terminal part of the signal seuqence. 
Examples of useful fusion signal sequences are 
described in Applicants' copending application, U.S. 
Application No* 07/377,450, filed July 10, 1989 and 
incorporated herein fully by reference* 

Transcriptional initiation regulatory signals 
that can be operably linked to the proteins of the 
invention can be selected which allow for repression 
or activation, so that expression of the operably 
linked genes can be modulated in a specific manner. 

Where the native expression control sequences 
signals do not function satisfactorily in a desired 
host cell, then sequences functional in the host cell 
may be substituted. 

Once the vector or DNA sequence containing the 
construct (s) is prepared for expression, the DNA con- 
struct (s) is introduced into an appropriate host cell 
by any of a variety of suitable means known in the 
art. In a preferred embodiment, Lactobacillus are 
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transforzned essentially as described in von Wright et 
al., Appl. Bnclron. Microbiol. 5^:2029-2035 (1990). 

To transform a host cell with the DNA constructs 
of the invention many vector systems are available, 
depending upon whether it is desired to insert the 
genetic DNA construct into the host cell chromosomal 
DNA, or to allow it to exist in an extrachromosomal 
form. When it is desired to maintain a vector in an 
extrachromosomal form the vector should provide an 
origin of replication capable of functioning in the 
host. 

Genetically stable transf ormants may also be 
constructed with vector systems, or transformation 
systems, whereby a desired protein's DNA is integrated 
into the host chromosome. Such integration may occur 
de novo within the cell or, in a most preferred 
embodiment, be assisted by treoisformation with a 
vector which functionally inserts itself into the host 
chromosome. For example, such vector may provide a 
DNA. sequence element which promotes integration of DNA 
sequences in chromosomes. In a prefesrred embodiment, 
such DNA sequence element is a sequence homologous to 
a sequence present in the host chromosome such that 
the integration is targeted to the locus of the 
genomic secpxence and teuxrets integration at that locus 
in the host chromosome. 

Cells that have stably integrated the introduced 
DNA into their chromosomes are selected by also 
introducing one or more markers which allow for 
selection of host cells that contain the expression 
vector in the chromosome, for example the marker may 
provide • biocide resistance, e.g., resistance to 
antibiotics, or the like. The selectable marker gene 
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can either be directly linked to the DNA gene 
sequences to be expressed, or introduced into the same 
cell by co-transformation. 

A sequence may also be incorporated into a 
plasmid or other vector capable of autonomous replica- 
tion in the recipient host, kny of a wide variety of 
vectors may be employed for this purpose. Factors 
of importance in selecting a particular vector 
include: the ease with which recipient cells that 
contain the vector may be recognized and selected from 
those recipient cells which do not contain the vector; 
the number of copies of the vector which are desired 
in a particular host; and whether it is desirable to 
be able to "shuttle" the vector between host cells of 
different species. 

In addition to antibiotic selection markers, one 
can (or should, if such antibiotic markers are 
undesirable in a specific process in which the hosts 
will be utilized) use natural selection markers based 
on^ for example, nisin resistance, thymidine 
synthesis, lactose utilization or X-prolyl-dipeptidyl- 
aminopeptidase (Appl. Environ. Microbiol. 57:38-43 
(1991) . 

After the introduction of the vector, recipient 
cells are grown in a selective medium, which selects 
for the growth of vector-containing cells. Expression 
of the cloned gene sequence (s) results in the 
production of the desired protein. This expression 
can tcdce place in a continuous memner in the 
tremsformed cells, or in a controlled maimer. 

The expressed acid a-amylase^ or fragment, 
variant or functional derivative thereof may be 
isolated and purified in accordance with the method 
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described herein and using techniques known in the 
art^ However, in a highly preferred embodiment, no 
isolation or purification of the product is needed as 
the host is directly inoculated into the process 
medium. 

The genetic sequences, protein sequences, vectors 
cuid methods of the invention atB useful in many 
applications of the food and beverage industry. 
Especially, the hosts of the- invention are useful when 
directly inoculated into foodstuff. The lactic acid 
bacteria of the invention may be utilized for the 
preparation of fermented food and beverages as lactic 
acid bacteria play ah essential role in the 
prepcuration of such fermented food and beverages. The 
Lactobacilli of the invention may also be applied in 
silage preparation and as probiotics in human and 
cuiimal health* The hosts containing the sequences and 
methods taught herein, together with, if necesssurf, 
current knowledge of the genetics and biocdiemistry of 
lactic acid bacteria, especially of Lactococci and 
Lactobacillus, may be used for the construction of 
bacterial argicultural process strains that have been 
optimized for different agricultural and food 
preparation applications using the sequences and 
methods of the invention « The genetics of Lactic acid 
bacteria are known in the art and are reviewed, for 
example, in BiochimiB 70:No. 3 and No« 4 (1988) and in 
FEMS Microbiol. Rbv. 87:No. 1 and No. 2 (1990). 

For example, one optimization using the sequences 
and methods of the invention is the addition of 
heterologous acid a-amylase activities, and especially 
hosts e3cpressing such activities, to any process 
strain. * Highly desirable hosts that may be utilized 
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as process strain host microorganisms include B. 
subtilis, B. Ixcheniformis, B. amylolxquBfacxens, B. 
polymyxa^ B. stearothermophilus, B. 
tliBrmoproteolytxcus, B. coagulans, B. thurxngxBiisxs, 
B. megatBrxum, B. cereus, B. natto, and, B. 
acidocaldarxus . 

Especially the acid a-amylase activity of the 
invention would be valuable for these strains as 
lactic, acid bacteria are widely used for preservation 
of many starch containing raw materials (e«g«, cereal 
grains, edible roots and crop residues) . 

Lactobacillus carrying and expressing the acid a- 
amylase gene of the invention, or a fragment, variant 
or functional derivative thereof, would be also 
extremely useful in food and fodder preparation. Such 
expression may occur under the control of the 
homologous (native) regulatory regions or under the 
control of heterologous promoter and/or signal 
sequences, especially those of Lactococcus or 
Lactobacillus origin. 

Additionally, the Lactobacillus hosts of th^ 
invention, carrying the acid amylase of the invention 
are useful for the preservation of feed and fodder 
material rich in starch, e.g. , cereal grains (siliaged 
with crimping) , corn (maize) , maize cobs and alfalfa. 

Another specific application of these strains is 
their use in starter feeds for calves, in milk 
replacers for calves younger than tour weeks (to 
increase digestibility of starch and for probiotic 
action) and in starter feed for piglets. 

Furthermore, these strains can be effectively 
used for^ preservation of material containing cereal 
euid slaughter scraps for fodder use. 
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The examples below are for illustrative purposes 
only and are not deemed to limit the scope of the 
invention. 

EXAMPLES 

Example 1 

Isolation of the Acid a-Amvlase Enzyme 
from Bacillus acidocaldarius 

Bacillus acidocaldarius (B. ac.) strain ATCC 
27009 was grown on agar plates containing (per liter) : 
1 g yeast extract^ p. 2 g (NH^)2S0^, 0.5 g MgSO^-7H20, 
0.25 g CaCl2-2H20, 0.6 g KH^PO^ and 2.5 g maltose, pH 
4.5. To isolate the a-an^lase enzyme, the entire 

surface of the agar plate was inoculated with ac. 
cells and the plates were incubated 3-5 days at 55^C. 
Xhe cultures were then suspended in 0.5 M sodium 
acetate, pH 5.0, and the cells were removed by 
centrifugation at 8000 g for 20 min. The supematant 
was centrifuged again at 40,000 g for 30 min and 
proteins in 50 ml of the clecired growth medium were 
precipitated at 0«C by slow addition of (NH^)2S0^ to a 
f incO. concentration of 70% (w/v) . After allowing the 
precipitate of the proteins to form for 30 min at 0«C, 
the precipitated proteins were collected by 
centrifugation at 10,000 g for 20 min. Xhe pellet was 
dissolved in 3 ml of 20 nfll BisTris, pH 5.8, and 
applied onto a Bio-Gel P-200 (Sigma, Richmond, USA) 
column (1.5 X 45 cti) - The elution was carried out in 
the same buffer. Rapid screening for amylase activity 
in the eluted fraction was performed using plate 
assay. -The assay plates contained 1.5% agar, 0.2% 
starch, 20 mM CaCl^r 50 iSm soditm acetate, pH 5.0. 
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Samples were applied into wells made in agar and^ 
after incixbation for 2-10 hours at 55*»C, the enzyme 
activity was detected by spreading 0.01 M 13/0. OlM KI 
solution onto the plates. The enzyme activity can be 
detected as a halo around the agar well. For 
quantitative assay of the acid a-amylase activity^ the 
Phadebas® amylase test (Pharmacia) was used. 

The amylase containing fractions from Bio-^1 P- 
200 were pooled cuid concentrated by ultrafiltration in 
a No vacel I'^'-Omegacel l*" apparatus ( Filtron , 
Northborough^ Massachusetts, USA). The concentrate 
was rechromatographed in a Superose 12 HR 10/30 column 
(Pharmacia, Uppsala, Sweden) in 20 mM Bis-Tris, pH 
5.8. The amylase containing fractions were 

concentrated as above by ultrafiltration and stored at 
-20*C. 

To detect the amylase activity in SDS- 
polyacrylamide gels, zymography was used (Harris, H. 
et al.. Handbook of Enzyme ElGctropboresis in Human 
Genetics^ North-Holland, Amsterdam, and. Methods in 
Enzymology 22:578 (1971). After SDS electrophoresis^ 
the gel was washed for 15 min in 50 sflf soditm acetate, 
pH 5.0 and put onto an amylase assay plate. After 
incubation at 55 for 10 hours the enzyme activity 
was detected as above. When the amylase (s) produced 
by B. ac. strain ATCC 27009, isolated and partially 
purified as described above, were analyzed by SDS-PAGE 
and zymography, two forms with relative moleculso: 
weights of 90,000 and 160,000 were detected (Figure 
1) . Similar results were obtained when the SDS-PA6E 
preceding zymography was performed under nonreducing 
conditions. This suggests that the 160 kd form was 
not a disulfide linked dimer of the 90 kd form. Under 
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the conditions in zyiaography the 160 Jed form had 
apparently much higher specific activity (Figure 1) - 

To assay the amylase activity after SDS-PAGE by 
zymography, the SDS-gel was first incubated in 20 mM 
CaClj, 50 mM sodium acetate, pH 5.0 for 15 min at room 
temperature, AFter this incvibation, the SDS-gel was 
placed on an agarose gel (3mm think, 1.0% agar, 0.2% 
starch, 20 siH CaCl^, 50 nHf sodium acetate, pH 5.0} . The 
gels were then incubated at 55 oc for 10 hours* To 
reveal the amylase activity, the agarose gel was 
stained by spreading O.OIH I^/O.OIM KI solution onto 
the gel. A clesur halo in the dark background 
indicated the amylase activity. 

In chromatofocusing (Huchthens, T.W. inrProtein 
Purification r J-C. Janson et al., eds*, V.H.c. 
Publishers Inc., New York (1989)) and isoelectric 
focusing (der Lan et al., in: Gel Electrophoresis of 
Proteins, B.D. Hames et al. ,.eds., IRL Press, Oxford) 
followed by zymography both the 160 kd and 90 kd 
polypeptides migrated as a broad peak and band, 
respectively, at pl 4.8. When the pH optimum of the 
enzyme (s) was measured, a symmetric curve with a 
relatively sharp pH optimum at pH 5 was obtained. The 
enzyme activity was about four fold at 60"C to that at 
37'>C (Figure 2) • The above data suggests that the 90 
kd polypeptide might be a degradation product of the 
160 kd protein. 
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N-Terminus Sequencing of tihe Acid a-Amvlase Gene 

To sequence the acid oc-amylase enzyme, SDS-PAGE 
was performed to separate the 90 kd and 160 kd 
polypeptides from the partially purified culture 
supernatant. After SDS-PAGE the gel was treated with 
1 H KCl to visualize protein bands and the band 
corresponding the enzyme activity was excised. From 
the gel the protein was electroeluted using an ISCO 
model 1750 electrophoretic concentrator as described 
in Advanced Methcxis in Protein Microsequence Analysis, 
Wittmann-Liebold, B., Salnikow, J. and Erdman, V.A. , 
eds., pp. 194-206, 1986, Springer-Verlag Berlin 
Heidelberg. Only the 90 kd polypeptide could be 
purified in an amount required for NH^-terminal 
sequencing. For Ml^-terminal amino acid sequence 
suialysis acid amylase was degraded in a gas/pulsed 
liquid sequencer (47. Prot. Cbem. 7:242-273 (1988) ) 
after electrophoretic transfer onto a polyvinylidene 
difluoride membrane Speicher, D.W. intTecbnigues in 
Protein Chemistry, T.E.Hugli, ed. , Academic Press., 
San Diego, CA (1989) . When the 90 kd protein 
electrotransferred onto a polyvinyl difluoride 
membrane was degraded, a single amino acid sequence of 
NHg-Asp-Ile-Asn-Asp-Tyr was obtained. 

For* the cloning of the corresponding acid a- 
amylase gene, eight lysylendopeptidase-cleaved 
peptides of the 90 kd protein were purified as 
follows: After electroelution of the 90 kd protein 
from the gel, the eluate was freeze dried and the 
solid material was redissolved in SO iil of 50 mM 
Tris/HClV pH 9.0. 150 ng of lysylendopeptidase (Wedco, 
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Dallas^ USA) was added and the mixtxire was incubated 
at 30 «C for 18 hours. The resulting peptides were 
separated by reverse phase chromatography on a Vydack 
218 TPB5 (0*46 X 15 cm) column connected to a Varian 
5000 liquid chromatograph. The peptides were elated 
using a linear gradient of acetonitrile (0-60% in 90 
min) in 0.1% trifluoroacetic acid. The peptides were 
sequenced after application on polybrene (2rag) 
pretreated glass fiber filters. The N-terminal 
sequence of the eight pturified peptides derived from 
90 kd. protein are shown in Table 1. Three of the 
eight N-terminal sequences (Table 1 no. 2^ 3 and 4) 
were used to synthetize oligonucleotides. The 
sequences of these oligonucleotides, used for 
screening of the B. ac.-gene library, are shown in 
Table 2. 



TABLE 1 

N-terminal secpiences of the lysylendopeptidase 
cleaved peptides of the 90 kd protein. 



Sequence 

tyr-asp-thr-ala-acp-tyr-phe-lys (SEQ ID Ho. 6] 
leu-v'al-8er-a8p-phG--gln-^herser^he--pro-x-asp--pro-thr^i^ 
(SEQ ID No. 7] 

iJ,e--a8p-pro--gly-phe-gly^t:hr^lii--gIn-asp-x-leu-a8n-leu--val-gl^ 
[SEQ ID Kg. 8] 

hiB-ala--val-ile--t:yr-glu-ile-mBt-pro-asp-pro-phe--tyr-a8n^ 

lle-ala (SEQ ID No. 9] 

leu— a8p-t:yr-*lett->ly8 [SEQ ID No. 10] 

aer--'lea-^ly^al-a8n--thr-leuH:yr^leu-nnet-pro-^al-phe^ [SEQ 
ID Ho. 11] 

phe-gly-aBn-phe*hi8*8er-a8n-gly [SEQ ID No. X2] 
gly-ile-tyr-val-gly^ala [SEQ ID No. 13] 



Peptide 

1 

2 

• 3 

4 

5 
6 

7 
8 
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Table 2 

Oligonucleotides used for screening of the B^ac. 
- a-amylase gene. Oligos 370, 371 and 372 
correspond to peptides 2, 3 and 4, respectively. 
••I" is inosine. 

oligonucleotide Sequence 

370 5« TA GAA GAT IGT IGG GTC III IGG GAA IGA GAA- 

CTG GAA GTC IGA lAC 3» [SEQ ID No. 3] 

371 5» OA GTC CTG CTG IGT ICC GAA ICC IGG GTC GAT 3' 
[SEQ ID No. 4] 

372 5* GO lAT GTT ICC GTT ITA GAA IGG GTC IGG CAT 

lAT CTC ITA lAT lAC I6C GTG 3« [SEQ ID No. 5] 



Example 3 

Cloning of the g. acldocaldarius Acid g-^Amvlase Gene 

ATCC 27009 B. ac. cells . were grown in a liquid medium 
containing [per liter] Ig yeast eactract, 0.2g (NH^}2SO^, 0.5 
g MgS0^,7H20, 0.25 g CaCl2, 0«6g KBjPO^ and ig glucose, pH 4.5 
at 55 ^C until late logarithmic phase. The cells were 
collected and the chromosomal DNA was isolated according to 
Marmur, J". Mol. Biol. 3:' 208-218 (1986)) , except that the 
DNA was dissolved in saline after first ^thcuiol 
precipitation* The chromosomal DNA was partially digested 
with Haelll and after agarose gel electrophoresis a 
fragment population of 4-6 kb wais isolated. The DNA 
fragments were cloned in E. coli using il-gtlO as the vector 
(Amersham) according to manufacturers recommendation. The 
£• ac. gene bsmk in Em coli was screened by plaque 
hybridization using oligonucleotides 370 [SEQ ID No. 3] ; 
371 [SEQ ID No. 4] and 372 [SEQ ID No. 5] (see table 2) .as 
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radioactive probes. After hybridization a positive clone, 
with sui 4«2 kb insert, was found. The insert was subcloned 
in M13 vectors and pBR322 and sequenced either by H13 
method or using direct plasmid sequencing method. 

oaie DNA sequence revealed an open reading frame of 
3532 nucleotides starting from the 5' end of the 4.2 kb 
insert (see Figure 3 and [SEQ ID. No. 1]). Within the open 
reading frame, starting from the codon 342, a deduced amino 
acid sequence DINDY (see Figure 3) can be foimd. This 
sequence corresponds to the N-terminal amino acid sequence 
of the 90 kd protein. The open reading frame downstream of 
codon 342 codes for a 92 kd protein whicdi is in good 
agreement with the purified H^90 kd band. Codon 342 is 
preceded by an open reading frame at the 5' end of the 4.2 
kb clone, and there are no sequences resembling either 
promoter or SD-regions in the near vicinity of the N- 
terminal codon (codon 342) . This strongly suggests that 
the 90 kd protein indeed is a breakdown product of a larger 
polypeptide and the regulatory regions of this polypeptide 
cUre outside of the 4.2 kb insert; Therefore, the B.ac. 
gene beuik was rescreened using a 5* fragment of the 4.2 kb 
clone as a probe. This resulted in a new 5.1 kb clone 
which partially overlapped 5* end of 4.2 kb fragment. The 
organization of these two clones was confirmed by Southern 
blotting from the B.ac. -genome and the joint region was 
* sequenced also directly from a relevant chromosomal PGR- 
figment. The 5.1 Kb clone was sequenced by H13 method. 

The DNA sequence upstream of the 4.2 kb fragment 
revealed that open reading frame of the 4.2 kb fragment 
continued additional 400 codons. The open reading was 
preceded by a laical promoter - SD-region, and the deduced 
N-terminsO. amino acid sequence showed typical features of 
the bacterid signal peptide (see Figure 3) . The combined 
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open reading fr2une codes for a protein of 150 kd which is 
in good agreement with the 160 kd band shown in the SDS- 
PAGE - zymography. The deduced amino acid sequence 
revealed sequences that are typical to other amylotic 
enzymes and surprisingly, a C*-terminus that resembles a 
hydrophobic anchor sequence. 

To combine the two parts of the amylase gene the 
relevant DNA fragments of the 5.1 kb and 4.2 kb clones have 
been ligated in to a pHP13 plasmid vector (/fol. Gen. 
Genetics 209:335-342 (1987)) and transformed in Bacillus 
sxjbtilis. The DNA analysis of the acid amylase/pHP13 
hybrid plasmid confirmed the intact acid amylase gene was 
stably maintained in B. subtilis. This hybrid plasmid was 
designated pKTH2034. 

Example 4 

Expression of the Acid g-Amvlase Gene in Lactobacillus 
Using the clone described herein, the B.ac. a-amylase 
gene is e3q>ressed in Lactobacillus host cells. The method 
is described in detail in Examples V, VI, VII and X, of the 
Applicants* copending Application Serial No. 377,450, filed 
July 10, 1989, the specification of which is incorporated 
herein by reference in its entirety as if at forth in full. 

Briefly, plasmids as exemplified by pKTH 1797, pKTH 
1798, pKTH 1799, pKTH 1801, pKTH 1805, pKTH 1806, pKTH 1807 
and pKTH liB09 contain promoter and secretion promoting 
signals. The promoter and secretion promoting sequences of 
these plasmids can direct the expression of heterologous 
gene and secretion of gene product Gram positive host cells 
such as Bacillus g Lactobacillus and Lactococaus 

To obtain expression of the acid a-an^lase gene and 
secretion - or '^export of the gene product, a host cell such 
as I^actobacillus is transformed with a plasmid comprising 
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-the promolier and secre'tion pro]not:ing sequences described 
above and the gene coding for the nature acid amylsise 
protein described herein. In addition of using a plasmid 
that is able to replicate in Lactobacillus host, a 
nonreplicative plasmid (e.g., pBR322 or p£l94) , beeoring 
random fragments of Lactobacillus chromosome, can be used. 
Transformation with nonreplicative plasmid results in 
integration of the acid amylase gene in the Lactobacillus 
chromosome « 

Xhe host containing the amylase gene is cultured in a 
suitable medium €tllowing escpression of the protein and the 
protein is recovered from the cell wall layer or culture 
medium or the host cell, producing the a-amylase activity, 
is directly used in the reqniired process. The i^ctoJbaclllizs 
host of the invention, modified to e3q>ress the acid^- 
amylase gene, may be utilized for the processing of food 
and feed as in the following examples > 

Example 5 
Processing of Forage 

Forage, for instance grass, is h2ucvested by either a 
ftail harvester or a precision chopper attached to a wagon 
of a t r act or* Preservation solution is added to the 
chopped forage mass during the harvesting in the 
chopper/harvester using equipment designed for this 
purpose. Th preservation solution is^prepared by adding an 
inoculant (a Lactobacillus strain as described above, 
capable of producing the acid amylase of the invention) , to 
a bottle or tank which contains half of its ^ volume as 
lukewarm water. This water should not be so hot as to kill 
the bacteria^ After shaking to evenly distribute the 
inoculant, the rest of the volume of the bottle is filled 
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by lukewarm water. Usually faraers use bottles of 30 
liters or tanks of 200 liters. The inoculant can be either 
in liquid form or as a dried powder of the acid amylase- 
expressing hosts of the invention. The inoculant may be 
prepared in fermentors using whey-based medium and the 
culture is grown to a cell density of approximately lO' 
cells/ml. After the culture has reached this density, the 
cells are collected by centrifugation and lyophilized to a 
final density of 10"-1G^2 cells/cf of dry weight. Twenty 
grams of this lyophilized material or equal amount of cells 
in a liquid concentrate is mixed with 30 liters of water as 
described above. Five liters of this diluted solution is 
applied per ton of forage. 

Alternatively, the prepared preservation solution can 
also be added to the forage mass after the wagon has been 
emptied in the silo in which fodder is ensiled. 

Example 6 ' 
Animal Feed Additive 

A concentrate mixture (powdered, granulated or 
pelleted) is manufactured using standard meuiufacturing 
systems. Each component in the mixture is separately 
weighed and then mixed together before pelleting, for 
instance. An inoculant (as described in Example 5) can be 
added or mixed into the mixture as one of the components! 
The concentrate, including the inoculant, can be fed to 
domiestic animals, sucdi as, for example, cows, calves, pigs, 
piglets, etc." 
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The inoculant: can also be fed separaliely as a 
specially £ormulat:ed powder, t:able1: or a licjuid product. 
!Hie inoculeml: is generally included in a carrier 
ingredient, for instance, skim milk powder. These products 
can be fed to domestic animals directly, as as a component 
together with the other feeds or in the drinking water of 
the animal. 

For each purpose, an inoculant is manuactured so that 
it contains 10"-10^^ cfu/g dry inoculant powder. The 
inoculant may be used in any concentration which is 
necessary to achieve the desired effect. Especially, when 
used as an additive to the anim2J.s* feed, the inoculant is 
used at 0.01-0 .02% in the concentrate mixture. 

The advantage of feeding the inoculsmt of the 
invention to domesticated farm animals such as the above is 
that by letting tlie hosts of the invention colonize the 
intestional tract of the animal, the animal is better able 
to digest the silage or fodder that it is fed, as the acid 
stable amylase secreted by such hosts would better degrade 
the ingested foodstuff. 

All references cited herein are incorporated herein by 
reference* VDiile this invention has been described in 
detail and with reference to specific embodiments thereof, 
it will be apparent to one skilled in the art that various 
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changes and modifications could be made therein without 
departing from the spirit and scope thereof. 
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SEQUENCE ZtZSTXNG 

(1) GENERAL XNFORKATIONs 

(1) APPLICANTS Teija Koivula, 

Hervi Sibakov, and 
Ilkka Palva 

(ii) TITLE OF INVENTION: Acid a-Amylase 
(iii) NUMBER OF SEQUENCES: 13 

(XV) OOimESPONDENCE ADDRESS: 



(A) 


ADDRESSEE: 


Sterne, Kessler, Goldstein & Fox 


(B) 


STREET: 


1225 Connecticut Avenue, N.H. , 




Suite 300 


(C) 


CITY: 


Washington 


(D) 


STATE: 


District of Columbia 


(E) 


COUNTRY: 


United States of America 


(F) 


ZIP: 


20036 



(V) COMPUTER READABLE FORM; 

(A) MEDIUM TYPE: Floppy Disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 
{ D } SOFTKARE s Ascii 

(Vl) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Mlchele Ann Cimbala 

(B) REGISTRATION NUMBER: 33,851 

(C) REFERENCE/DOCKET NUMBER: 1X55.0090000 

(vlil) TELEGOUMUNXCAXION INFORMATION: 

(A) TELEPHONE: (202) 833-7533 

(2) INFORUAXION FOR SBQ ID NO :1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 4140 bases 

(B) TYPES Nucleic Acid 

(C) STRANDEDNESS: Double Stranded 

(D) TOPOLOGYs Linear 

(ii) MOLECULAR TYPES Nucleic Acid 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID MOsls 

CATGT66GTC GCCTCCTCTG CGATGGGACA TCTCACTTAC ATTTTAAGAG TCGAAGGGTG 60 

GGGAAGGGAA CAAGGGTCTT GAGAGAAAAG AAAGGATTC6 GACAATGAAC GATGCAAGAG 120 

CACCATTGAC GT06GGAAGG CTGAGCGTAT AATCCA66AC GACAACGTTA GTGCAATCGT 180 



TTGGACAAGA GGAGGTT6AT GCACGTGAAG T6GAC6GGTG CCGCGAGOCT TGCGACGACC 240 
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ATCGTCATGT CTGCAGCGCT TCCGTTCX5CG GCGTTTGAAA ACACCOGTTC CGTCG6CACG 300 

GAACGCX;TTC TGGCGGCGAC CGCGGCGGAT AGTACTAACXJ CCTOGCCCCC GACCGGGGCG 360 

TCCX;CAGGCG GTGGCACAGC AGGAGAGACC TACTCGAATG COGTGCAGAT TGTGAGCGGC 420 

AATGGCTTTC GCGCTGGAAA CGGGGCCACG GTCGTCCTCG CTGTGAATGG ACTGAACTTG 480 

AACCCCT06A GTTTCAAGGT TGTGGTATOG AATAGCCTCA. ACGGCGTGAT CGAG6TGACA 540 

AGTGATTCOG T6CTAACGGG GCOGAATTOG ATTGCCTTGC ACCTTCCTGC CGGG6ATGAC 600 

GGGCTTGGGG C06GTACGTA GAOGGTGACC GTOGAAAGTG GAGGTGATTC GGTCTCGAOG 660 

CCTTCAG6AC AGGGACTTCA GGTCATGCCT TATAG6A0GG CCGACACCAT TGAGTGGGAT 720 

GGAATCTACA CATCOGATGG C3G0GATGTAC GTGTCAGATC CGAATOCCTC CCCTCGACAA 780 

GAGGTGA06A TCAGCCTTCG AGOGTACAGC GGGAACCTGA CGAAGGTGAT CCTGAATTGC 840 

TGGGACACOG CACAAAACAA GGGTTTCCAG GTOGAGATGT CTCCA6GCA6 AAOGTTTGGA 900 

COGTATCAGC TCTGGTCTGC CACGATCCCC GCTTCAAACG GOGGAAOGAT CTATTATC5GC 960 

TTTGACATAT A06ATGGCAC CAGTTTTGCG TGTCTCTCAG GTGAOGGACT GCACA06TCT 1020 

GACGACATCA ACAACAATTT CCOGTTGCCC GTGGGGACGG TGAOGCTTTC GACACTTGAG 1080 

GCGAATCCC96 GTGATAOGGT GA06GTCTCC GACOCTGTAO GTGACTTOGC OGGAAGCCAG 1140 

GATCAACXSCA ATCACAOGGT GATA0G6TTT GTGAACTOGT GGGGOGAAAC GGC06CGACA 1200 

GTCAATGGGA OGAAOGOGAG CTGGAACA6C GTGCAGTTCA GAGTCCCACA GAGCCTTCCA 1260 

AACGGCTTGT ATCGOGTCGA GATOGACACG GTOGCCAAGG ACGOGGATGG GGTGGTCAAT 1320 

GTCGAATTGG AGAGGAGTGC GGAGCTTATT GTAGGGCCTC TGGC06C6TG GATGCAAGOG 1380 

TATGCACATG ATT06TTTCA GGCGTTCTAC 06AT0GCCTT TOGGAGCCGT GTCGACAGGA 1440 

ACCOCGATCA 06CTTCGCCT G06GGCT00G CTCAGCGTGA AGAGTG06AC GCTTCOCXiTC 1500 

TGGGGGGGAG OGGATGAGTC AGGOGAGATC GACCTGCOGA TGGAGAAGCT CCAAATGT06 1560 

6GAGACQAGT TGGOGCAACA AAO0GG06TG CAGGACATGA ACGACTAGAC 6TGGTGGA0G 1620 

GTGACCKrCC CTG0GG06GA TGTGACGACA COOGGOACGA TGTCGTATCA GTTC3GTGACG 1680 

GAGA06GACSA CT6GOCSAGGT GGTCTACZAC GATGACAAT6 GA6CTCAGCT TGAAGG60CT 1740 

GGCCAGGTTG GGTTGTCTTC C3GA0GGACCG AGCTACCAGA TGAGOGTATA C6AA0GGGGA 1800 

TTTCAGACGC GAGATTGGCT GAAGGA06CC GTGATCTACG AAATGATGCC GGATOGGTTC 1860 

TACAATGGGA ATATOGCCAC GGAGGAGAAT COGAATAOGC AAAAGGGGAT TTATGTAGGG 1920 

GCCGATGGAA OGGAGTCATT AGGGCCCATC GAGTTCCACG AGAACTGGGA CTCGCGGCCC 1980 

TATGATOOGA ATATTCCTCC GTTATCTGAT CCCAAAATTG CCAGTCTGOG AGGCAATGGC 2040 



wo 92/19744 



PCr/FI92/00138 



-42- 

CAAT66AAC21 TTGACTTTTT OGGAGGTGAT TTGCAGGGCIA TC6AGGATAA GCTGGACTAC 2100 

CTGAA6AGOC TIGGAGXCAA TACGCTGTAT CTGATGOCCG TCTTTGAGGC GGAATCCAAT 2160 

CACAAATATG ACACAGCOGA CTATTTCAAG ATTGACCCTG GATTTGGAAC GCAGGAGGAC 2220 

TGGCTGAATC T06XACAGGC TGC6CACG0G AAGGGGTTCC ATATCATTCX CGACGGGGTG 2280 

TTOGAAGATA CCGGAA6TGA CAGOGTATAT TTCAACAAGT TC3GGGAACTT CCACTCCAAC 2340 

GGTGC»TC6C AGGCGTACGTr GAA6AACCAG CCGTCGCTGT OGCCCTACTA CTCGTGGTAC 2400 

6TGTGGACAG GGAACACCTC AAACOCATAC GATTOGTGGT TTCAGATCGA CACGCTCOCA 2460 

crTAC3GGACA CGTOGAACOC CGCCTATCAG CGATTCGTGT ATGGGAGCGA CAACTCAGTC 2520 

60GOGTGTGT G6ATC0G6GA AGGTGCGGAC GGATGGOGCT TGGACXCGGC C6ACAAOGGG 2580 

AATTTCaVACH CGGCATGGTG GGGTGGCTTT GGGCAGGCOG TGAAATCGAT OGATCCCAAC 2640 

GCSIGCXSATCA TOG6C6A6AT CIGGGACAAT GCGACGAATG ACAATGGAAC GGATTGGTTG 2700 

ACX366AT0GA GCTTOGACAG TCTAATGAAC TACCAGTTCC GGAAOGCXX^T GATC3GACTTC 2760 

TTCOGGGGCA GGTAGAAGGA OGGAAAOGTG CAGCACCAOG COGTOGAOGC TGOGGGATTC 2820 

AACGAGGAAC TGATGOGOCT GTACAGTGAA TATOCTCTGC AGTCGTTCZA CTCGATGAT6 2880 

AACCTTGTOG ATTGGGAAGA CACCAT60GG ATCCTGACCA TCTTGGAGAA OGOGCOGGAG 2940 

GCAGGCGAIC rCATOCXSOGCI OCAGCAGGAT GAGTACAGGC GGTCTCCIGC GGCIGAAGAG 3000 

TTGGGGATC3G AGAGGCI62UI GCTIGTATCG GACTTTCAAT TCAGCTTOCJC GGGOGATCCG 3060 

AGCATCTICX AOGGOGAOGA GGCAGGGCTC ACTGGTTATT OGGATCCCCX CAATOGTCGG 3120 

ACCaCaiCOGT GGGAGAAOCA GAATCIOGAT CTOCTGAAOC ACXACCGCAA GCTOGGGGOC 3180 

ATTCGAAAOG OCAAIOCIGT GCITGAGAOG GGGGATTTCA CGCOGITGXA OGCACAGGGC 3240 

AI6GTGTACG GATTTGGAAG GAOCAITCGG AATGGGOGAG ATCTCITCXK; IGTGCCRGOG 3300 

GAGGAIGCCA 06GCCATTGT GGOGATCAtkC AATCAGAACC AAGCIATCAC OGTGACCAXT 3360 

GOGAGGGATG GGACGGTTGC GGAOGGGTCC ACSGATGCTOG ATGAACIGAA CAACCAGTGG 3420 

TACaAGGTGC AGAATGGTGG CATTACACTC AOGCIGCAAT CGTATCftAGG TGCCATTTIG 3480 

GTGAOGCOGA GCGAOGGX?CC GATGGCTTAT CTGCAAGAGG AGGATICICA GAACGAGATT 3540 

GC6TGGA06C CTGTGCAAGG TGOCATOGGT TATCGCGTCT GGAGACAGAA TCOGAATGGA 3600 

CAATGGGTGC CCTTTGGACC TGTGCTTCCT GCCACGGACT T6A6TGTCAC GGTGGAACGC 3660 

GATGCATATG CX5CAAACGTT TGCTGTACAA GCGCTGTTTT CGGCX5TCTGA TCAOGCCCAG 3720 

TCXCOGGTGT CGGC31CCTAA GACGGTATC6 CTTCCCGTCG ATGTGCXZCGC GGTACX5CCTG 3780 

AGXCA6CX»A TCGTTAGTGG TCGTGTGGTT GGAGATCX5TG CGATGGTCTC GATCACGCCG 3840 
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GTTTCAGGCG CXSACGCAGTA TGTGATCTAC CAGAGACAGG G0GA06GATC GTATGCTCOG 3900 

GTCGCGACGG TCTCCACAAG TGGCGATTCC GCAGCTATAG GGGAAGTTCC TGCGCAAGGT 3960 

CCGGCCAACT CGCCTCAOGC GACGATTCGC GTGACAGTGC CCGTACCTGC A6GTTTCTCG 4020 

TCGGTGACCT ACCGCGTGGC TGCGCAAAAC GAAGATGGGC AAGCTGTGAC CAATCCATTG 4080 

ACCXTTATOGC TCT06AAAAA GTGATGCCTC GC6AAAAGGG CTAT066AAT TTTTTCX;AGA 4140 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 amino acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

{ D ) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2s 



Met Hia Val Lys Trp Thr Gly Ala Ala Ser Leu Ala Thr Thr lie Val 
15 10 15 

Met Ser Ala Ala Leu Pro Phe Ala Ala Phe Glu Asn Thr Arg Ser Val 
20 25 30 

Gly Thr Glu Arg Val Leu Ala Ala Thr Ala Ala Asp Ser Thr Asn Ala 
35 40 45 

Ser Pro Pro Thr Gly Ala Ser Ala Gly Gly Gly Thr Ala Gly Glu Thr 
50 55 60 

Tyr Ser Asn Ala Val Gin lie Val Ser Gly Asn Gly Phe Arg Ala Gly 
65 70 75 80 

Asn Gly Ala Thr Val Val Leu Ala Val Asn Gly Leu Asn Leu Asn Pro 
85 90 95 

Ser Ser Phe Lys Val Val Val Ser Asn Ser Leu Asn Gly Val lie Asp 
100 105 no 

Val Thr Ser Asp Ser Val Leu Thr Gly Pro Asn Ser He Ala Leu His 
115 120 125 

Leu Pro Ala Gly Asp Asp Gly Leu Gly Ala Gly Thr Tyr Thr Val Thr 
130 135 140 

Val Glu Ser Gly Gly Asp Ser Val Ser Thr Pro Ser Gly Gin Gly Leu 
145 150 155 160 

Gin Val Met Pro Tyr Thr Thr Ala Asp Thr He Gin Trp Asp Gly He 
165 170 175 
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Tyr Thr Ser Asp Gly Ala Met Tyr Val Ser Asp Pro Asn Pro Ser Pro 
180 185 190 

Gly Gin Glu Val Thr lie Ser Leu Arg Ala Tyr Ser Gly Asn Leu Thr 
195 200 205 

Lys Val He Leu Asn Cy^ ^^H? ^^P Asn Lys Gly Phe Gin 

210 215 220 

Val Glu Met Ser Pro Gly Arg Thr Phe Gly Pro Tyr Gin Leu Trp Ser 
225 230 235 240 

Ala Thr He Pro Ala Ser Asn Gly Gly Thr He Tyr Tyr Arg Phe Asp 

245 250 255 

He Tyr Asp Gly Thr Ser Phe Ala Cys Leu Ser Gly Asp Gly Leu His 
260 265 270 

Thr Ser Asp Asp He Asn Asn Asn Phe Pro Leu Pro Val Gly Thr Val 
275 280 285 

Thr Leu Ser Thr Leu Gin Ala Asn Pro Gly Asp Thr Val Thr Val Ser 
290 295 300 

Asp Pro Val Gly Asp Phe Ala Gly Ser Gin Asp Gin Pro Asn His Thr 
305 310 315 320 

Val He Arg Phe Val Asn Ser Ser Gly Glu Thr Ala Ala Thr Val Asn 
325 330 335 

Gly Thr Asn Ala Ser Trp Asn Ser Val Gin Phe Thr Val Pro Gin Ser 
340 345 350 

Leu Pro Asn Gly Leu Tyr Arg Val Glu He Asp Thr Val Ala Lys Asp 
355 360 365 

Ala Asp Gly Val Val Asn Val Glu Leu Asp Arg Ser Ala Glu Leu He 
370 375 380 

Val Gly Pro Leu Pro Ala Trp Met Gin Ala Tyr Ala His Asp Ser Phe 
385 390 395 400 

Gin Ala Phe Tyr Arg Ser Pro Phe Gly Ala Val Ser Thr Gly Thr Pro 
405 410 415 

He Thr Leu Arg Leu Arg Ala Pro Leu Ser Val Lys Ser Ala Thr Leu 
420 425 ' 430 

Arg Leu Trp Gly Ala Ala Asp Gin Ser Gly Glu He Asp Leu Pro Met 
435 440 445 

Gin Lys Leu Gin M&1: Ser Gly Asp Glu Leu Ala Gin Gin Thr Gly Val 
450 455 460 

Gin Asp He Asn Asp Tyr Thr Trp Trp Thr Val Thr He Piro Ala Ala 
465 470 475 480 

Asp Val Thr Thr Pro Gly Thr Met Trp Tyr Gin Phe Val Thr Glu Thr 
485 490 495 
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Asp Thr Gly Gin Val Val Tyr Tyr Aep Asp Asn Gly Ala Gin Leu Glu 
500 505 510 

Gly Pro Gly Gin Val Gly Leu Ser Ser Asp Gly Pro Ser Tyr Gin He 
515 520 525 

Ser Val Tyr Glu Arg Gly Phe Gin Thr Pro Asp Trp Leu Lys His Ala 
530 535 540 

Val He Tyr Glu He Met Pro Asp Arg Phe Tyr Asn Gly Asn He Ala 
545 550 555 560 

Thr Glu Glu Asn Pro Asn Thr Gin Lys Gly He Tyr Val Gly Ala Asp 

565 570 575 

Gly Thr Glu Ser Leu Gly Pro He Gin Phe His Glu Asn Trp Asp Ser 
580 585 590 

Pro Pro Tyr Asp Pro Asn He Pro Pro Leu Ser Asp Pro Lys He Ala 
595 600 605 

Ser Leu Arg Gly Asn Gly Gin Trp Asn He Asp Phe Phe Gly Gly Asp 
610 .615 620 

Leu Gin Gly He Glu Asp Lys Leu Asp Tyr Leu Lys Ser Leu Gly Val 
625 630 635 640 

Asn Thr Leu Tyr Leu Met Pro Val Phe Glu Ala Glu Ser Asn His Lys 
645 650 655 

Tyr Asp Thr Ala Asp Tyr Phe Lys He Asp Pro Gly Phe Gly Thr Gin 

660 665 670 

Gin Asp Trp Leu Asn Leu Val Gin Ala Ala His Ala Lys Gly Phe His 
675 680 685 

He He Leu^ Asp Gly Val Phe Glu Asp Thr Gly Ser Asp Ser Val Tyr 
690 695 700 

Phe Asn Lys Phe Gly Asn Phe His Ser Asn Gly Ala Trp Gin Ala Tyr 
705 710 715 720 

Leu Lys Asn Gin Pro Ser Leu Ser Pro Tyr Tyr Ser Trp Tyr Val Trp 
725 730 735 

Thr Gly Asn Thr ser Asn Pro Tyr Asp ser Trp Phe Gin lie Asp Thr 
740 745 750 

Leu Pro Leu Thr Asp Thr Ser Asn Pro Ala Tyr Gin Arg Phe Val Tyr 
755 760 765 

Gly Ser Asp Asn Ser Val Ala Arg Val Trp He Arg Glu Gly Ala Asp 
770 775 780 

Gly Trp Arg Leu Asp Ser Ala Asp Asn Gly Asn Phe Asn Thr Ala Trp 
785 790 795 800 

Trp Gly Gly Phe Arg Gin Ala Val Lys Ser He Asp Pro Asn Ala Ala 
805 810 815 
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Ile lie Gly Glu lie Trp Asp Asn Ala Thr Asn Asp Asn Gly Thr Asp 
820 825 830 

Trp Leu Thr Gly Ser Thr Phe Asp Ser Val Met Asn Tyr Gin Phe Arg 
835 840 845 

Asn Ala Val lie Asp Phe Phe Arg Gly Thr Tyr Asn Asp Gly Asn Val 
850 855 860 

Gin His His Ala Val Asp Ala Ala Gly Phe Asn Gin Glu Leu Met Arg 
865 870 875 880 

Leu Tyr Ser Glu Tyr Pro Leu Gin Ser Phe Tyr Ser Met Met Asn Leu 

885 890 895 

Val Asp Ser Gin Asp Thr Met Arg He Leu Thr He Leu Glu Asn Ala 
900 905 910 

Pro Gin Pro Gly Asp Leu Ser Ala Leu Gin Gin Asp Glu Tyr Arg Pro 
915 920 925 

Ser Pro Ala Ala Glu Gin Leu Gly He Glu Arg Leu Lys Leu Val Ser 
930 935 940 

Asp Phe Gin Phe Ser Phe Pro Gly Asp Pro Thr He Phe Tyr Gly Asp 
945 950 955 960 

Glu Ala Gly Leu Thr Gly Tyr Ser Asp Pro Leu Asn Arg Arg Thr Tyr 
965 970 975 

Pro Trp Asp Asn Gin Asn Leu Asp Leu Leu Asn His Tyr Arg Lys Leu 
980 985 990 

Gly Ala He Arg Asn Ala Asn Pro Val Leu Gin Thr Gly Asp Phe Thr 
995 1000 1005 

Pro Leu Tyr Ala Gin Gly Met Val Tyr Ala Phe Ala Arg Thr He Arg 
1010 1015 1020 

Asn Gly Arg Asp Val Phe Gly Val Pro Ala Glu Asp Ala Thr Ala He 
1025 1030 1035 1040 

Val Ala He Asn Asn Gin Asn Gin Ala He Thr Val Thr He Pro Thr 
1045 1050 1055 

Asp Gly Thr Val Ala Asp Gly Ser Thr Met Leu Asp Glu Leu Asn 'Asn 
1060 1065 1070 

Gin Trp Tyr Lys Val Gin Asn Gly Gly He Thr Leu Thr Leu Gin Ser 
1075 1080 1085 

Tyr Gin Gly Ala He Leu Val Thr Pro Ser Asp Ala Pro Met Ala Tyr 
1090 1095 1100 

Leu Gin Glu Glu Asp Ser Gin Asn Glu He Ala Trp Thr Pro Val Gin 
1105 HIO 1115 1120 

Gly Ala He Gly Tyr Arg Val Trp Arg Gin Asn Pro Asn Gly Gin Trp 
1125 1130 1135 
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Val Pro Phe Gly Pro Val Leu Pro Ala Thr Asp Leu Ser Val Thr Val 
1140 1145 1150 

Glu Arg Asp Ala Tyr Ala Gin Thr Phe Ala Val Gin Ala Leu Phe Ser 
1155 1160 1165 

Ala Ser Asp His Ala Gin Ser Pro Val Ser Ala Pro Lys Thr Val Ser 
1170 1175 1180 

Leu Pro Val Asp Val Pro Ala Val Arg Leu Ser Gin Pro lie Val Ser 
1185 1190 1195 1200 

Gly Arg Val Val Gly Asp Arg Ala Met Val Ser He Thr Pro Val Ser 
1205 1210 1215 

Gly Ala Thr Gin Tyr Val He Tyr Glh Arg Gin Gly Asp Gly Ser Tyr 
1220 1225 1230 

Ala Pro Val Ala Thr Val Ser Thr Ser Gly Asp Ser Ala Ala lie Gly 
1235 1240 1245 

Glu Val Pro Ala Gin Gly Pro Ala Asn Ser Pro His Ala Thr lie Arg 
-1250 1255 1260 

Val Thr Val Pro Val Pro Ala Gly Phe Ser Ser Val Thr Tyr Arg Val 
1265 1270 1275 1280 

Ala Ala Gin Asn Glu Asp Gly Gin Ala Val Thr Asn Pro Leu Thr Leu 
1285 1290 1295 

Ser Leu Ser Lys Lys- 
1300 



(2) INFORMATION FOR SEQ ID N0s3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 bases 

(B) TYPE: Nucleic Acid 

(C) STRANOEDNESS: Double Stranded 

(D) TOPOLOGY; Linear 

(ii) HOLECOLAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NOs3s 

TAGAAGATIG TI6GGTCIII IGGGAAIGAG AACTGGAA6T CI6AIAC 47 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: Nucleic Acid 

(C) STRANOEDNESS: Double Stranded 

(D) TOPOLOGY: Linear 
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(11) MOLECOLAR TITPE: Nucleic Acid 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

CA6TCCTGCX GIGTICCGAA ICCIGGGTCG AT 32 

(2) INFQRKATION FOR SEQ ID NO: 5: 

(1) SEQUENCE CBARACTERISTICS: 

(A) LENGTH: 52 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double Stranded 

(D) TOPOLOGT: Linear 

(U) MOLECULAR TYPE: Nucleic Add 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCIATGTTIC GGTTITAGAA IGG6TCIGGCA TIATCTCITA lATIAClGOG TG 52 



(2) INFORKATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 Amino Acids 

(B) TYPE: Amino Add 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(11) MOLECULAR TYPE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TYR ASP TBR ALA ASP TYR PRE LYS 
1 5 8 



(2) INFORMATION FOR SEQ ID NO:7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 Amino Acids 

(B) TYPE: Amino Add 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(11) MOLECULAR TYPE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

LEU VAL SER ASP PHE GLN PHE SER PHE PRO Xaa ASP PRO THR ILE PHE 

1 .5 10 15 

TYR 
17 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 Amino Aclde 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

ILE ASP PRO 6LY PHE GLY THR GLN GLN ASP Xaa LEU ASN LEU VAL GLN 
1 5 10 15 

ALA 
17 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I.ENGTH: 18 Amino Acids 

(B) TYPE: Amino Aeid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

HIS ALA VAL ILE TYR GLU ILE MET PRO ASP PRO PHE TYR ASN GLY ASN 
15 10 15 



ILE ALA 
18 



(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: 5 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:10: 



LEU ASP TYR LEU LYS 
1 5 
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(2) INFORHATION FOR SEQ ID NOslls 

(1) SEQUENCE CHARACTERISTICS: 

(A) I£NGTIi: 15 Amino Acids 

(B) TITPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(11) HOLECDLAR TlTFEs 

(xl) SEQtlENCE DESCRIPTIONS SEQ ID NO: 11: 

SER LEU GLY VAL ASN THR LEO TYR LEU KET FRO VAL PBE GLU ALA 
15 10 15 



(2) INFORMATION FOR SEQ ID N0:12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRimDEDNESS: 

(D) TOPOLOGY: Linear 

(11) MOLECULAR TYPE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

PHE GLY ASN PHE HIS SER ASN GLY 
1 5 8 



(2) INFORMATION FOR SEQ ID NO: 13: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 Amino Acids 

(B) TYPE: Amino Acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



GLY ILE TYR VAL GLY ALA 
1 5 6 
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WHAT IS CIAIMED IS; 



1. Substantially pure acid a-amylase DNA. 

2. The substantially p\ire acid a-aiaylase DNA of claim 1, 
wherein the sequence of said DNA is that of SEQ ID No, 1, or a 
biologically active fragment, variant, or functional derivative 
thereof. 

3. A recombinant DNA molecule, wherein said recombinant 
DNA molecule comprises the DNA of claim 2 • 

4. A cloning vector, wherein said cloning vector comprises 
the recombinant DNA molecule of claim 3. 

5. The cloning vector of claim 4, wherein said recombinsoit 
DNA molecule is capsible of being expressed. 

6. A host cell transformed with the recombinant DNA 
molecule of claim 3. 

7. The host cell of claim 6, wherein said host cell is a 
bacterial host. 

8. The host cell of claim 7, wherein said host cell is a 
Lactobacillus or a Bacillus. 

9. The host cell of claim 8, wherein said host cell is 
Lactobacillus. 
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10. The host cell of claim 8^ wherein said host cell is 
Bacillus. 

11. A method for the production of acid aamylase, wherein 
said method comprises: 

(i) providing a nucleic acid molecule comprising the 
nucleic acid seq of SEQ ID No. 1, or a fragment, variant or 
functional derivative thereof; 

(ii) transforming a host with said molecule; 

(iii) expressing said nucleic acid molecule in said 

host; £Uid 

(iv) isolating the acid a-amylase produced by said 
e3cpression. 

12. The method of claim 11, wherein said host is 
LsLCtobacillas. 

13. The method of claim 12, wherein said host is Bacillus • 

14. A substantially purified B. acidocaldarius acid a- 
amylase, wherein said acid a-amylase comprises the amino acid 
sequence of SEQ ID No. 2, or a biologically active fragment, 
variant or functional derivative therof . 
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ls. The substantially purified acid a-amylase of claim 14, 
wherein said acid a-amylase is produced by a method comprising: 

(i) providing a nucleic acid molecule comprising the 
nucleic acid seq of SEQ ID No. 1, or a biologically active 
fragment, variant or functional derivative thereof; 

(ii) transforming a host with waid molecule; 

(iii) expressing said nucleic acid molecule in said 

host; and 

(iv) isolating the acid • a-amylase produced by said 
expression. 

16. A method for preserving feed or fodder, wherein said 
method comprises the addition of the host of claim 6 to said feed 
or fodder. 

17. The method of claim 16, wherein said host is 
Lactobacillus m 

18. The method of claim 16, wherein said feed is cereal 
grain. 
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B , Acldocaldarius amylase: DNA sequence 

10 20 30 40 50 60 



70 80 90 100 110 120 

G6GAAGGGAACAA666TCTT6A6A6AAAAGAAAGGATTCGGACAAT6AACGAT6CAAG^ 



130 140 150 160 170 180 

CACCATTGACGTCGGGAAGGCTGAGCGTATAATCCAGGACGACAACGTTAGTGCAATO? 



190 200 210 220 230 240 

TTGCAOUVGAGGAGGTTGATGCACGTGAAGTGGACGGGTGCCGCGAGCCTTGCGACGACC 

MetHlsValLysTrpThrGlyAlaAlaSerLeuAlaThrThr 

250 260 270 280 290 300 

ATCGTCATGTCTGCAGCGCTTCCGTTOXIGGCGTTTGAAAACACXXIGTTCCGTCGGCAC^ 
IleValMetSerAlaAlaLeuProPheAlaAlaPheGltiAsnThrArgSerValGlyThr 

310 320 330 340 350 360 

GAACXX:GTTCTGGCG6CX3ACCGCGGCGGATAGTACTAACGCCTCGCCCCC6ACCGGG^^ 
GluArgValLeiiAlaAlaThrAlaAlaAspSerThrAsnAlaSerProProThrGlyAla 

370 380 390 400 410 420 

TCCGCTIGGCGGTGGCACAGCAGGAGAGACCTACTCGAATGCCGTGCAGATTGTG^ 
SerAlaGly61y61yThrAlaGly61uThxfF3rxSerAsnAlaVal6I^ 

430 440 450 460 470 480 

AATG6CTTTCGCGCTGGAAACGGGGCCACGGTCGTCCTCGCTGT6AATG6ACTGAACTTG 
AsnGlyPheArgAlaGlyAsnGlyAlaThrValValLeiiAlaValAsnGlyLeuAsnLeu 

490 500 510 520 530 540 

AACCCCTCGAGTTTCAAGGTTGTGGTATCGAATAGCCTCAACGGCGTGATCGACGTCACA 
AsnProSerSerPheLysValValValSerAsnSerLeuAsnGlyVallleAspValThr 

550 560 570 580 590 600 

AGTGATTCCGTGCTAACGGGGCCGAATTCGATTGCCTTGCACCTTCCTGCCG6GGATGAC 
SerAspSerValLeuThrGlyProAsxiSerlleAlaLeiiHlsLeuProAlaGlyAspAsp 

610 620 630 640 650 660 

GGGCTTGGCGCCGGTACGTACACGGTCACCGTCGAAA6TGGAGGTGATTCGGTCTCGACG 
GlyLeuGlyAlaGlyThrTyrThrValThrValGluSerGlyGlyAspSerValSerThr 

670 680 690 700 710 720 

a;TTCAGGACAGG6ACTTCA6GTCATG0CTTATACGACG6CCGACACCATTCAGTGGGAT 
ProSerGlyGlnGlyLeuGlnValMetProTyrThrThrAlaAspThrlleGlnTxpAsp 

730 740 750 760 770 780 

6GAATCTACACATCCGATGGC6C6AT6TAC6TGTCAGATCCGAATCCCTCCCCTGGACAA 
GlylleTyrThrSerAspGlyAlaMetTyrValSerAspProAsnProSerProGlyGln 
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790 800 810 820 830 840 

GAGGTGACGATCy^GCCTTCGAGOSTACAGCGGKSaVACCTGACGAAGGTGATCXTG^ 
GluValThrlleSerLeuArgAlaTyrSerGlyAsnLeuThrLysVallleLeuAsnCys 

850 860 870 880 890 900 

TGGGACACCGCACAAAACAAGGGTTTCCAGGTCGAGATGTCTCCAGGCAGAACGT 
TrpAspThrAlaGlnAsnLysGlyPheGlnValGluMetSerProGlyArgThrPh^ 

910 920 930 940 950 960 

CCGTATCAGCTCTGGTCTGCaVCGATCX:CCGCTTCAAACGGa3»A^ 
ProTyrGlnLeuTrpSerAlaThrlleProAlaSerAsnGlyGlyThrlleT^ 

970 980 990 1000 1010 1020 

TTT6ACATATACGATGGCACCAGTTTTGCGTGTCTCTCAGGT6ACGGACTGCACACGTCT 
PheAspIleTyrAspGlyThrSerPheAlaCysLeuSerGlyAspGlyLeuHisThrSer 

1030 1040 1050 1060 1070 1080 

GAa»CATCAACAACAATTTCCXXM!T6CXXXITG^^ 

AspAspIleAsnAsnAsnPhePxxjLeuProValGlyThrValThrLeuSerThrte 

1090 1100 1110 1120 1130 1140 

GCGAATCCXX5GT6ATACGGTGACGGTCTCCSGACCCTGTAGGTGACTTCGCCGG 
AlaAsnProGlyAspThrValThrValSerAspProValGlyAspPheAlaGlySerGln 

1150 1160 1170 1180 1190 1200 

GATCAACCCAATCACACGGTGATACGGTTTGTCAACPCGTCGGGOG^ 
AspGlnProAsnHlsThrVallleArgPheValAsnSerSerGlyGluThrAlaAlaThr 

1210 1220 1230 1240 1250 1260 

GTCAATGGGACGAACGCGAGCTGGAAOVGCGTGCAGTTC^CAGTCCCyiCAGAGCCTTC^ 
ValAsnGlyThrAsnAlaSerTrpAsnSerValGlnPheThrValProGlnSerLeuP^ 

1270 1280 1290 1300 1310 1320 

AACGGCTTGTATCGCGTCGAGATCGAOVCCGTCGCCAAGGACGCGGATGGGGTGGTC^ 
AsnGlyLeuTyrArgValGluIleAspThrValAlaLysAspAlaAspGlyValValAsn 

1330 1340 1350 1360 1370 1380 

GTCGAATTGGACAGGAGTGCGGAGCTTATTGTAGGGCCTCTGCCCGCGTGGATGCAAGCG 
ValGluLeuAspArgSerAlaGluLeuIleValGlyProLeuProAlaTrpMetGlnAla 

1390 1400 1410 1420 1430 1440 

TATGCACATGATTCGTTTCAGGCGTTCTACCGATC6CCTTTCGGAGCCGTGTCCACAGGA 
TyrAlaHisAspSerPheGlnAlaPheTyrArgSerProPheGlyAlaValSerThrGly 

1450 1460 1470 1480 1490 1500 

ACCaXATCACGCTTOSCXiTGCGGGCTCCGCT^ 

ThrProIleThrLeiiArgLeuArgAlaProLeuSerValLysSerAlaThrLeuArgL^ 

1510 1520 1530 1540 1550 1560 

TGGGGGGCAGCGGATCAGTCAGGCGAGATCGACCTGCCGATGCA6AAGCTCCAAATGTCG 
TrpGlyAlaAlaAspGlnSerGlyGluIleAspLeuProMetGlnLysLeuGlnMetSer 
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1570 1580 1590 1600 1610 1620 

66A6ACGAGTT6GC6CAACAAACCG6C6TGCAG6ACATCAACGACTACAC6TG6TGGACG 
GlyAspGluLeuAlaGlnGlnThrGlyValGlnAspIleAsnAspTyxThrTxpTz^ 

1630 1640 1650 1660 1670 1680 

6TGACX:ATCCCTGCGGCG6ATGTGACCACACCCGGGACGATGT6GTATCA6TTC6T6AC6 
ValThrlleProAlaAlaAspValThrThrProGlyThrMetTxpTyrGlziPh^ 

1690 1700 1710 1720 1730 1740 

6AGAC6GACACTGGCCAG6TGGTCTACTACGATGACAAT6GA6CTCAGCTTGAAGGGCCT 
GluThrAspThrGlyGlnValValTyrTyrAspAspAsnGlyAlaGli^ 

1750 1760 1770 1780 1790 1800 

GGCCAGGTTGG6TTGTCTTCCGACGGACCGAGCTACCAGATCA6CGTATACGAACGGGGA 
GlyGlnValGlyLeuSerSerAspGlyProSei:T3^rGlnIleSerValTyrGliiArgGly 

1810 1820 1830 1840 1850 1860 

TTTCAGAO^CX^GATTGGCTGAAGCACGCaSTGATCTACGAAATCyiTGCCTC 
PheGlnThrPTOAspTrpLeuLysHlsAlaVallleTyrGluIleMetProAspArgPhe 

1870 1880 1890 1900 1910 1920 

TACAATGGCAATATOSCCACGGAGGAGAATCXZGAATAaX:^^ 
TyrAsnGlyAsnlleAlaThrGluGluAsnProAsnThrGlnLysGlylleTyrValGly 

1930 1940 1950 1960 1970 1980 

GCCGATGGAACGGAGTCATTAGGCCCXIATCCAGTTCCACGAGAACTGGGACTCG^ 
AlaAspGlyThrGluSerLeuGlyProIleGlnPheHlsGluAsnTrpAspSerProPro 

1990 2000 2010 2020 2030 2040 

TATGATCCGAATATTCCTCCGTTATCTGATCXCAAAATTGCCAGTCTGaSAGGCAATG 
TyrAspPxtiAsnlleProProLeuSerAspProLysIleAlaSerLeiiA^ 

2050 2060 2070 2080 2090 2100 

CAATG6AACATTGACTTTTTCGGAG6T6ATTTGCAGGGCATCGAGGATAAGCTGGACTAC 
GlnTrpAsnlleAspPhePheGlyGlyAspLeuGlnGlylleGluAspLysLeuAspTyr 

2110 2120 2130 2140 2150 2160 

CTGAAGAGCCTTGGAGTCAATACGCTGTATCTGATGCCCGTCTTTGAGGCGGAATCCAAT 
LeuLysSerLeuGl^^alAsnThrLeuTyrLeuMetProValPheGluAlaGluSerAsn 

2170 2180 2190 2200 2210 2220 

CACAAATATGACACAGCCGACTATTTCAAGATTGACCCTGGATTTGGAACGCAGCAGGAC 
HisLysTyrAspThrAlaAspTyrPheLysIleAspProGlyPheGlyThrGlnGlnAsp 

2230 2240 2250 2260 2270 2280 

TGGCTGAATCTCGTACAGGCTGCGCACGCGAAGGGGTTCCATATCATTCTC^ 
TrpLeuAsnLeuValGlnAlaAlaHisAlaLysGlyPheHisIlelleLeuAspGlyVal 

2290 2300 2310 2320 2330 2340 

TTCGAAGATAOXSGAAGTGACAGCGTATATTTCAACAAGTTCGGGAACTTCCACTCC^ 
PheGluAspThrGlySerAspSerValTyrPheAsnLysPheGlyAsnPheHlsSerAsn 
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2350 2360 2370 2380 2390 2400 

G6TGC6TGGCA6GCGTACCTGAA6AACCA6CCGTCGCTGTC6CCCTACTACTC6TG6TAC 
GlyAlaTrpGlnAlaTyrLeuLysAsnGlnProSerLeuSerProTyrTyrSer^ 

2410 2420 2430 2440 2450 2460 

GTGTGGACAGGGAACACCTCAAACCGATACGATTCGTGGTTTOVGATCGACAOSC^ 
ValTrpThrGlyAsnThrSerAsnProTyrAspSerTrpPheGlnlleAspThrLeuPro 

2470 2480 2490 2500 2510 2520 

CTTACGGACACGTCGAACCCCGCCTATCAGCGATTCGTGTATGGGAGCGACAACTCAGTC 
LeuThrAspThrSerA8nProAlaTy3r61nArgPheValTyr61ySe^^ 

2530 2540 2550 2560 2570 2580 

GCGCGTGTGTGGATCCGGGAAGGTGCGGAO^GATGGCGCTTGGACTCGGCXXSAC^ 
AlaArgValTrpIleArgGluGlyAlaAspGlyTrpArgLeiiAspSerAlaAspAsnGly 

2590 2600 2610 2620 2630 2640 

AATTTC^CACXKXIATGGTGGGGTGGCTTTaSGCAGGC^^ 

AsnPheAsnThrAlaTrpTrpGlyGlyPheAx^GliiAlaValLysSerlleAspProA^ 

2650 2660 2670 2680 2690 2700 

GCAGCGATCATCGGCGAGATCTGGGACAATGCSAaSAATGACAATGGAACGG 
AlaAlallelleGlyGluIleTrpAspAsnAlaThrAsnAspAsnGlyThrAspTrpLeu 

2710 2720 2730 2740 2750 2760 

ACGGGATCGACCTTCGACAGTGXAATGAACTACCAGTTCCGGAACGCCGTGATC6ACTTC 
ThrGlySerThrPheAspSerValMetAsnTyrGlnPheArgAsnAlaValXleAspPhe 

2770 2780 2790 2800 2810 2820 

TTCXXX:6GCACGTACAACGAC6GAAACGTGCAGCACX:ACGCCGT(^ACGCTGCGGGA 
PheAxgGlyThrTyrAsnAspGlyAsnValGlziHlsHlsAlaValAspAlaAla 

2830 2840 2850 2860 2870 2880 

AACCAGGAACTGATGCGCCTGTACAGTGAATATCCTCTGCAGTCGTTCTACTCGATGATG 
AsnGlnGluLexiMetArgLeuTyrSerGluTyrProLeuGlnSerPheTyrSerMetMet 

2890 2900 2910 2920 2930 2940 

AACCTTGTCGATTCGCAAGACACCATGCGGATCCTGACCATCTTGGAGAACGCGCC^^ 
AsnLeuValAspSerGlnAspThrMelJ^glleLeuThrlleLeuGluAsnAlaPro 

2950 2960 2970 2980 2990 3000 

CCAGGCGATCTATCCGCGCTCCAGCAGGATGAGTACAGGCCGTCTCCTGCGGCTGAACAG 
ProGlyAspLeuSerAlaLeuGlnGlnAspGluTyrArgProSerProAlaAlaGluGln 

3010 3020 3030 3040 3050 3060 

TTGGGGATOGAGAGGCTGAAGCTTGTATCGGACTTTCAATTC^CTTCCCGGGCGATa:G 
LeuGlylleGluArgLeuLysLeuValSerAspPheGlnPheSerPheProGlyAspPro 

3070 3080 3090 3100 3110 3120 

ACCATCTTCTACGGCGACGAGGCAGGGCTCACTGGTTATTCGGATCCCCTCAATCGTCGG 
ThrllePheTyrGlyAspGluAlaGlyLeuThrGlyTyrSerAspPTOlaetiAsnAr^ 
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3130 3140 3150 3160 3170 3180 

ACCTATCCGtGGGACAACCAG»ATCTCGATCTCCTGAACa^CTACCGCAAGCTCGGGGCC 
ThrTyrProTrpAspAsnGlnAsnLeuAspLeuLeuAsnHlsTyrArgLysLeuGlyAla 

3190 3200 3210 3220 3230 3240 

ATTCGAAACGCCAATCCTGTGCTTCAGACGGGGGATTTavax:CGTTGTAC6CA(^ 
IleArgAsnAlaAsnProValLeuGlnThrGlyAspPheThrProLeuTyrAlaGlnGly 

3250 3260 3270 3280 3290 3300 

ATGGTGTACXXy^TTTGCAAGGACCaVTTOSGAATGGGCGAGATGTCTTCXIGTGTGC^ 
MetValTyrAlaPheAlaAxgThrlleArgAsnGlyAzgAspValPheGlyValProAla 

3310 3320 3330 3340 3350 3360 

GAGGATGCCACGGCCATTGTGGCXyiTCAAaU^TCAGAACCyVAGCTATCACC^ 
GliiAspAlaThrAlalleValAlalleAsnAsnGlnAsnGlnAlalleThrValThrlle 

3370 3380 3390 3400 3410 3420 

CCGACGGATGGGACGGTTGCGGACGGGTCCACGATGCTCGATGAACTGAACAAC^ 
ProThrAspGlyThrValAlaAspGlySerThrMetLeuAspGluLeuAsnAsnGl^ 

3430 3440 3450 3460 3470 3480 

TACAAGGTGCAGAATGGTGGCATTACACTCACGCTGCAATTCTATCAAGGTGa^ 
TyrLysValGliiAsnGlyGlylleThrLeuTOirLeuGlnSerT^^ 

3490 3500 3510 3520 3530 3540 

GTGACGCCGAGOJACGGGCCGATGGCTTATCTGCAAGAGGAGGATTCl^ 
ValThrProSerAspAlaProMetAlaTyrLeuGlnGluGliaAspSerGlnAsnGl 

3550 3560 3570 3580 3590 3600 

gcgtggacgcctgtgcaaggtgccatcggttatcgcgtctggagaca6aatcx:gaatgga 

AlaTrpThrProValGlnGlyAlalleGlyTyrArgValTrpArgGlnAsnProAsn 

3610 3620 3630 3640 3650 3660 

CAATGGGTGCCCTTTGGACCTGTGCTTCCTGCCACGGACTTGAGTGTCACGGTGGAACGC 
GlnTrpValProPheGlyProValLeiiProAlaThrAspLeiiSerValThrValGluArg 

3670 3680 3690 3700 3710 3720 

GATGCATATGCGCAAACGTTTGCTGTACAAGCGCTGTTTTCGGC6TCT6ATCACGCCCAG 
AspAlaTyrAlaGlnOSirPheAlaValGlnAlaLeiiPheSerAlaSerAspHisAlaGln 

3730 3740 3750 3760 3770 3780 

TCTCCGGTGTCGGCACCTAAGACGGTATCGCTTCCCGTCGATGTGCCCGCGGTACGCCT6 
SerProValSerAlaProLysThrValSerLeuProValAspValProAlaValArgLeu 

3790 3800 3810 3820 3830 3840 

AGTCAGCCGATCGTTAGTG6TCGTGTGGTTG6AGATCGTGC6ATGGTCTCGATCACGCCG 
SerGlnProIleValSerGlyArgValValGlyAspArgAlaMetValSerlleThrPro 

3850 3860 3870 3880 3890 3900 

GTTTCAGGCGCGACGCAGTATGTGATCTACCAGAGACAGGGCGACGGATCGTATGCTCCG 
ValSerGlyAlaThrGlnTyrVallleTyrGlnArgGlnGlyAspGlySerTyrAla 
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3910 3920 3930 3940 3950 3960 

GTCGCGACXKSTCTCaVCAAGTGGCGATTOJGCAGCTATAGGGGAAGTTCCTGCGCAATC 
ValAlaThrValSerThrSerGlyAspSerAlaAlalleGlyGluValProAlaGlnGly 

3970 3980 3990 4000 4010 4020 

CCGGCCaUVCTCXCCTaVCGCGACGATTCGCGTGACAGTGCCCGTACX^ 
ProAlaAsnSerProHlsAlaThrlleArgValThrValProValProAlaGlyPheSer 

4030 4040 4050 4060 4070 4080 

TCGGTGACCTACOXXSTGGCTGCGCAAAACGAAGATGGGCAAGCTGTC 
SerValThxTyrArgValAlaAlaGlnAsnGluAspGlyGlnAlaValTh^ 

4090 4100 4110 4120 4130 4140 

ACa:TATCGCTCTCGAAAAAGTGATGCX;TCGCGAAAAGGGCTATC»GAATTTTTTC^ 
ThrLeuSerLeuSerLysLys 
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